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(54) Tide: METHODS AND CONSTRUCTS FOR PRODUCING MALE STERILE PLANTS 
(57) Abstract 

A process is described for producing fenile hybrid seed or hybrid seed comprising fertile and sterile seed using male-sterile plants 
created by employing molecular techniques to manipulate genes that are capable of controlling the production of fertile pollen m plants. 
Hybrid seed production is simplified and improved by this approach, which can be extended to plant crop species for which commercially 
acceptable hybrid seed production methods have not been available. 
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Methods and constructs for producing male sterile plants 

TECHNICAL FIELD 

The present, invention relates to a method for 
producing male sterile plants and hybrid seed, to genetic 
5 material used to impart the male sterility trait and to 
new products produced by said method, namely, genetically 
transformed plants carrying the male sterile trait, male 
sterile plants and hybrid seed produced by pollinating 
said plants with pollen from male fertile plants. The 

10 present invention also relates to a method for producing 
a plant carrying t'e male sterile trait, or its seed, 
produced by pollinating said plant with pollen from 
genetically transformed male fertile plants that, by 
virtue of the genetic material used, is resistant to 

15 herbicides and, if so desirable, is restored in respect 
to fertility of the hybrid seed. 

BACKGROUND 

Production of hybrid seed for commercial sale is a 
large industry. Hybrid plants grown from hybrid seed 
20 benefit from the heterotic effects of crossing two 
genetically distinct breeding lines. The agronomic 
performance of this offspring is superior to both 
parents, typically in vigour, yield, and uniformity. The 
better performance of hybrid seed varieties compared to 

2 5 open-pollinated varieties makes the hybrid seed more 

attractive for farmers to plant and thereby commands a 
premium price in the market place. 

In order to produce hybrid seed uncontaminated with 
selfed seed, pollination control methods must be 

3 0 implemented to ensure cross -pollination and not self- 

pollination. Pollination control mechanisms can be 
mechanical, chemical, or genetic. 

A simple mechanical method for hybrid seed production 
can be used if the plant species in question has 
35 spatially separate male and female flowers or separate 
male and female plants. The corn plant, for example, has 
pollen producing male flowers in an inflorescence at the 
apex of the plant and female flowers in the axils of 
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leaves along the stem. Outcrossing is assured by 
mechanically de- tasselling female plants to prevent 
self ing . 

Most major crop plants of interest, however, have 
both functional male and female organs within the same 
flower so emasculation is not a simple procedure. It is 
possible to remove by. hand the pollen fozrming organs 
before pollen shed. But this form of hybrid seed 
production ■ is extremely labour intensive and expensive. 
Seed is produced in this manner if the value and amount 
of seed recovered warrants the effort. 

A second general method of producing hybrid seed is 
to use chemicals that kill or block viable pollen 
formation. These chemicals, termed gametocides, are used 
to impart a transitory male-sterility. Commercial 
production of hybrid seed by use of gametocides is 
limited by the expense and availability of the chemicals 
and the reliability and length of action of the 
applications. . These chemicals are not effective for 
crops with an extended flowering period because new 
flowers will be produced that will not be affected. 
Repeated application of chemicals is impractical because 
of costs . 

Many current commercial hybrid seed production 
systems for field crops rely on a genetic method of 
pollination control. Plants that are used as females 
either fail to make pollen, fail to shed pollen or 
produce pollen that is biochemically unable to effect 
self-fertilization. Plants that are unable (by any, or 
a combination, of several different means) to self 
pollinate biochemically are termed self -incompatible . 
Difficulties associated with the use of self- 
incompatibilities are: availability and propagation of 
the self -incompatible female line and stability of the 
self -incompatibility . In some instances self- 

incompatibility can be overcome chemically or immature 
buds can be pollinated by hand before the biochemical 
mechanism chat blocks pollen is activated. Self- 
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incompatible systems that can be deactivated are often 
very vulnerable to stressful climatic conditions that 
break or reduce the effectiveness of the biochemical 
block to self-pollination. 
5 Of more widespread interest for commercial seed 

production are systems of pollen control based on genetic 
mechanisms causing male sterility. These systems are of 
two general types: (a) genie male sterility, which is the 
failure of pollen formation because of one or more 

10 nuclear genes or (b) cytoplasmic-genetic male sterility 
(commonly called cytoplasmic male sterility or. CMS) in 
which pollen formation is blocked or aborted because of 
a defect in a cytoplasmic organelle (mitochondrion) (for 
general discussions on genie sterility, CMS and hybrid 

15 formation in plants see Frankel, R., et al.. Pollination 
Mechanisms, Reproduction and Plant Breeding ; Springer V. , 
et al . , Monographs on Theoretical and Applied Genetics , 
N.y, 1977; Edwardson, J. P., Bot. Rev, 36:341-420, 1970). 
Nuclear (genie) sterility can be either dominant or 

20 recessive. A dominant sterility can only be used for 
hybrid seed production if fertility of the hybrid plants 
is not critical, and if propagation of the female line is 
feasible, for example, by clonal propagation or by the 
use of a selectable marker closely linked to the 

25 sterility gene. 

Many successful hybridization schemes involve the use 
of CMS. In these systems, a specific mutation in the 
cytoplasmically located mitochondrion can, when in the. 
proper nuclear background, lead to the failure of mature 

3 0 pollen formation. In some other instances, the nuclear 
background can compensate for the cytoplasmic mutation 
and normal pollen formation occurs. The nuclear trait 
that allows pollen formation in plants with CMS 
mitochondria is called restoration and is the property of 

35 specific "restorer genes". Generally the use of CMS for 
commercial seed production involves the use of three 
breeding lines, the male-sterile line (female parent) • a 
maintainer line which is isogenic to the male-scerile 
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line but contains fully functional mitochondria and the 
male parent line. 

The male parent line may/carry the specific restorer 
genes (usually designated a restorer line) which then 
imparts fertility to the hybrid seed. For crops such as 
vegetables, for which seed recovery from the hybrid is 
unimportant, a CMS system could be used without 
restoration. For crops for which the fruit or seed of 
the hybrid is the commercial product then the fertility 
of the hybrid seed must be restored by specific restorer 
genes in the male parent or the male -sterile hybrid must 
be pollinated. Pollination of non-restored hybrids can 
be achieved by including with hybrids a small percentage 
of male fertile plants to effect pollination. In most 
species, the CMS trait is inherited maternally- (because 
all cytoplasmic organelles are inherited from the egg 
cell only) , which can restrict the use of the system. 

In a crop of particular interest herein, the oilseed 
crop of the , species Brassica napus or Brass ica 
campestris, commonly referred to as canola, no commercial 
hybrid system has been perfected to date. Mechanical 
emasculation of flowers is not practical for hybrid seed 
production on any scale. The use of currently available 
gametocides is impractical because of the indeterminate 
nature of flower production. Repeated application of 
chemicals is expensive and the method is prone to 
contamination with self ed seed. 

Genes that result in self -incompatibility are quite 
widespread in Brassica species and self -incompatible 
hybrid systems have been used for hybrid seed production 
in vegetables. Major difficulties are associated with 
the propagation of the female lines and the breakdown of 
self -incompatibilities under stressful conditions. 
Adaptation of these systems to Brassica oilseeds is 
restricted by the expense of increasing the female lines 
and the availability of appropriate self - incompatible 
genes in the dominant Canola species, Brassica napus. 
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A variety of sources of male sterility are available 
in Brassica species. Both recessive and dominant genie 
systems have been reported, however their use is 
restricted because large scale in vitro propagation or 
roguing of female lines is in most cases impractical for 
large scale seed production. 

Additionally, a number of CMS systems have been 
reported in Brassica species. Four of these systems have 
been explored as possible vehicles for hybrid seed 
production: pel, nap, anand and ogu . The Polima system 
(pol) has been widely studied and is probably the closest 
to commercial use. Good restoration and maintenance of 
pol CMS has been achieved, however the system suffers 
from potential instability of the CMS with high 
temperature, a reduction in the heterotic effect of 
crossing different lines (because of the defective 
mitochondria) and a reduction in hybrid seed oil content. 
The use of other CMS systems is also restricted by heat 
sensitivity (nap) , difficulty in restoration of fertility 
(ogu, anand) , difficulty in the maintenance of the 
sterility (nap) and low temperature chlorosis associated 
with the sterile cytoplasm (ogu) . Improvement of these 
systems is the object of considerable research, however 
all of the systems have some inherent weaknesses that 
limit their utility. 

For a general discussion of male sterility in 
Brassica see Shiga, T., Male Sterilitv and Cvtoplasmin 
Differentiation. In Brassica Crops and Wild Allea. 
Biology and Breedincf, Japan Scientific Societies Press, 
Tokyo pp. 205-221; Thompson, K.F., Heredity 29 :253-257) . 

It is recognized that a desirable system for hybrid 
seed production in any crop would be a form of genie male 
sterility that could be regulated or overcome to allow 
male fertility for the propagation or increase of the 
female lines or to allow fertility in hybrids. This 
recognition has stimulated research on the use of 
molecular systems to effect genie male sterility that 
could be used for hybrid seed formation. In addition. 
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the advent and widespread application of recombinant DNA 
techniques may provide a mechanism of introduction of 
novel DNA sequences into a wide variety of different crop 
species that is not possible by the limited sexual 
methods of genetic exchange between different species. 
A molecular approach has the advantage that the 
hybridization system can be imposed on all breeding lines 
or cultivars of any given crop without the need for 
extensive backcrossing and disruption of established 
inbred lines leading to the rapid production of male 
sterile lines with well characterized and superior 
agronomic performance . 
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SUMMARY OF THE INVENTION 

In accordance with one aspect of the present 
invention, a recombinant DNA molecule has been provided 
for use in preparing a male sterile plant of a pollen- 
5 bearing species that is capable of being transformed. 
The recombinant DNA molecule comprises 

(A) a first DNA sequence coding for a first gene 
product that converts a first substance which 
is present in the plant into a second 

10 substance, wherein the first and second 

substances do not inhibit pollen ' formation or 
function, 

(B) a second DNA sequence coding for a second gene 
product that converts the second substance to 

15 a third product that inhibits pollen formation 

or function, 

(C) a first promoter that controls expression of 
the first DNA sequence in plants, 

(D) a second promoter that controls expression of 

2 0 the second DNA sequence in plants, and 

(E) a third DNA sequence that confers resistance to 
a selective agent. 

In a preferred embodiment, the aforementioned resistance 
is to a herbicide, such as a herbicide selected from the 

25 group consisting of glyphosate, glyphosinate and TBL. 

In another preferred embodiment, at least one of the 
first and second promoters is specific to cells that are 
critical to pollen formation or function. In particular, 
either or both of the first and second promoters can be 

30 specific to anther cells or, alternatively, to 
microspores. In still another preferred embodiment, the 
second promoter is an inducible promoter. 

In a recombinant DNA molecule as described above, the 
first DNA sequence can be lamS and second DNA sequence 

3 5 lamH. Alternatively, the second substance and the second 

expression product can be paired as follows: 2-amino-4- 
methoxy-butanoic acid and methoxinine dehydrogenase or 
rhizobitoxine synthase; a non- toxic analog of glucuronic 
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acid and )S - glucuronidase ; napthalene acetamide or indole 
acetamide and indole acetamide hydrolase . 

In accordance with another aspect of the present 
invention, there is a provided a method for producing a 
male sterile plant, comprising the steps of 
(A) introducing a recombinant molecule as described above 
into the genome of a pollen producing plant which is 
capable of being genetically transformed; and (B) growing 
the pollen producing plant under conditions such that 
male sterility is achieved as a result of the expression 
of the first DNA sequence and the second DNA sequence, 
providing the male sterile plant. In a preferred 
embodiment, the method further comprises, after step (B) , 
the steps of (C) crossing the male sterile plant with an 
fertile, isogenic plant to obtain seed, then (D) growing 
out the seed to produce progeny plants, and (E) selecting 
male sterile plants from among the progeny by exposure to 
the selective agent. 

According .to yet another aspect of the present 
invention, a method is provided for producing hybrid seed 
from a male sterile plant, comprising the steps of 

(A) introducing a first recombinant molecule as 
described above into the genome of a pollen 
producing plant which is capable of being 
genetically transformed; 

(B) growing the pollen producing plant under conditions 
such that male sterility is achieved as a result of 
the expression of the first DNA sequence and the 
second DNA sequence, yielding a male sterile plant; 
and 

(C) crossing the male sterile plant with pollen from a 
male fertile line, the pollen having integrated into 
its genome a second recombinant DNA molecule 
comprising a selective marker gene and a promoter 
regulating expression of the selective marker gene 
in plants. 

Pursuant to a preferred embodiment of this method, 
the second recombinant DNA molecule further comprises 
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(i) a third DNA sequence and (ii) a third promoter that 
causes transcription of the third DNA sequence in plants, 
wherein the third DNA sequence is transcribed as an 
antisense RNA or encodes a gene product that negates the 
effect of the first DNA sequence or the second DNA 
sequence, such that hybrid seed with restored fertility 
is produced by the male sterile plant, and the third 
promoter causes the transcription such that expression of 
the third DNA sequence overlaps in time with expression 
of the first DNA sequence or the second DNA sequence. In 
the alternative, the third DNA sequence - can have an 
antisense RNA transcript, can be a repressor gene, or can 
encode a gene product that is a ribozyme, a dominant 
negative transact ivator, or a transdominator 
According to another preferred embodiment, the selective 
marker genes imparts herbicide resistance. 

Various products also come within the scope of the 
present invention. These include, for example, hybrid 
seed produced by a method described above, a plant cell 
comprising elements (A) through (E) of the aforementioned 
recombinant DNA molecule , and a differentiated plant 
comprising such a cell. 

The following advantages over other hybridization 
systems may be obtained with the present invention: 

(a) Hybrid seed production is not labour intensive 
and can be achieved on a large scale with commercially 
acceptable costs. 

(b) Male sterility is simply inherited and stable 
in response to environmental stresses that limit the 
effectiveness of self -incompatibility and CMS based 
schemes . 

(c) Seed that is produced will be relatively 
uncontaminated by selfed seed. 

(d) The system avoids the use of defective 
cytoplasmic organelles that may detract from the 
performance of hybrid seed. 

(e) The system will greatly speed the development 
and increase the number of lines that can be tested as 
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parents in a hybrid cross because it can be imposed on 
any plant or inbred line capable of being transformed and 
regenerated into plants without the inclusion of 
additional genomic DNA. Additionally plant lines can be 
tested for combining ability before inclusion of the 
hybridizing system which can modify breeding strategy. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a representation of the construction of 
an anti- sense gene vector that was used for ant i- sense 
RNi^ inhibition of iS- glucuronidase gene activity in 
transgenic plants. 

Figure 2a is a schematic representation of a 
restriction map and coding region of clone number L 4, a 
microspore specific clone isolated from a Brassica napus 
genomic library. The clone contains three similar copies 
of a single gene. These genes are identified as Bp4A, 
Bp4B and Bp4C. The first (Bp4A) and third (Bp4C) genes 
are functional, the second gene has modifications that 
most likely render it non-functional. The restriction 
map is made diagrammatically in that the non-transcribed 
regions are shown as a single line, while the transcribed 
regions are shown as a boxed area. The second gene 
(Bp4B) is identified on the basis of sequence homology 
and is shown as a boxed area with a dotted line. The 
notation "del 220" refers to an approximately 22 0 base 
pair deletion/rearrangement that has presumably 
inactivated the second gene (Bp4B) in this clone. Start 
of transcription is located at the leftmost side of each 
boxed area (except in the case of gene Bp4B) and exon and 
intron positions are noted by the exons being filled in 
with black and the intron positions being left unfilled. 
A small arrowhead is shown on the non- transcribed 5' 
region of each gene, this arrowhead serves to indicate 
the promoter region of each gene. Restriction sites are 
identified such that the number of the first nucleotide 
of the restriction enzyme recognition sice xs shown. Not 
all restriction sites are shown, only those relevant to 
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the constructs detailed within are indicated. The genes 
are presented with the 5' region on the left side and the 
3' region on the right side. The numeration of the DNA 
sequence is from left to right, 5' to 3 ' in all cases. 

Figure 2b is a schematic representation of a 
restriction map and coding region of clone number L 10, 
a microspore specific clone isolated from a Brassica 
napus genomic library. The clone contains a single gene. 
Start of transcription, exon, intron and promoter 
positions are noted as in Figure 2a. Restriction sites 
are identified such that the number of the first 
nucleotide of the restriction enzyme recognition site is 
shown. The genes are presented with the 5' region on the 
left side and the 3' region on the right side. The 
numeration of the DNA sequence is from left to right, 5' 
to 3' in all cases. 

Figure 2c is a schematic representation of the 
restriction map and coding region of clone number L 16, 
a microspore specific clone isolated from a Brassica 
napus genomic library. The clone contains a single gene 
that shows similarity to clone L 10. Intron and exon 
positions are noted as in Figure 1.. Restriction sites 
are identified such that the number of the. first 
nucleotide of the restriction enzyme recognition site is 
shown. The genes are presented with the 5' region on the 
left side and the 3' region on the right .side. The 
numeration of the DNA sequence is from left to right, 5' 
to 3' in all cases. 

Figure 2d is a schematic representation of the 
restriction map and coding region of clone number L 19, 
a microspore specific clone isolated from a Brassica 
napus genomic library. The clone contains a single gene. 
^^^^^ of transcription, exon, intron and promoter 
positions are noted as in Figure 1, Restriction sites 
are identified such that the number of the first 
nucleotide of the restriction enzyme recognition site is 
shown. The genes are presenced with che 5' region on the 
left side and the 3- region on the right side. The 
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numeration of the DNA sequence is from left to right, 5' 
to 3' in all cases. 

Figure 3a is the complete nucleotide sequence of the 
clone L 4 represented in Figure 2a. Only the coding 
5 strand is shown, for clarity. 

Figure 3b is the nucleotide sequence of the portion 
of the clone L 10 shown in Figure as being underlined in 
Figure 2b. Only the coding strand is shown, for clarity. 
Figure 3c is the nucleotide sequence of the portion 

10 of the clone L IS shown in Figure as being underlined in 
Figure 2c. Only the coding strand is shown, for clarity. 

Figure 3d is the nucleotide sequence of the portion 
of the clone L 19 shown in Figure as being underlined in 
Figure 2d. Only the coding strand is shown, for clarity. 

15 Figure 4 is the nucleotide sequence of 3 cDNA clones 

isolated from a microspore derived cDNA library of 
Brass ica napus . These clones are named cBp4 0i, cBp4 05, 
and cBp408. These three cDNA clones are extremely 
homologous to members of the L4 Brassica napus microspore 

20 specific gene family {Bp4A, Bp4B, Bp4C) . The nucleotide 
sequence of two of these 3 members of the L4 Brassica 
napus microspore specific gene family are shown in this 
Figure (Bp4A, Bp4C) . The gene Bp4C was chosen as a master 
sequence for comparison. The deduced nucleotide coding 

25 sequence for the genes Bp4A and Bp4C is shown as a 
sequence from which the two exons of the genes have been 
spliced together at the positions normally spliced in 
vivo. This gives rise to the coding sequence in the 
mature mRNA. The cDNA clones are aligned with the 

3 0 sequence of Bp4C such that only nucleotide changes are 
shown. The sequences are therefore represented as 
variants of a single master sequence of gene Bp4C which 
is shown on line 1. The ATG start codon as well as the 
TGA or TAA stop codons are underlined. These three cDNA 

35 clones correspond to related members of the Brassica 
napus microspore specific gene family a portion of which 
is contained in the clone L4 . 
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Figure 5 shows a partial nucleotide sequence of a 
cDNA clone that is closely homologous . to the • gene 
contained in the clone LIO, the restriction map of which 
is shown in Figure 2b, 

Figure 6 is the nucleotide sequence of the cDNA clone 
that is the gene product of clone L19/ the restriction 
map of which is shown in Figure 2d. 

Figure 7 (7A, 7B, 7C, 7D) are schematic representa- 
tions describing the production of vectors containing the 
promoter and promoter regions from clone L4 . The 
specific examples are discussed in greater detail below. 
Figure 7E is a schematic representation of the promoter 
constructs produced as shown schematically in Figures 7A 
to 7D. 

Figure 8 is a schematic representation describing the 
production of vectors containing the promoter regions of 
clone LIO, the details of which are discussed below. 

Figure 9 is a schematic representation describing the 
production of vectors containing the promoter regions of 
clone L19, the details of which are discussed below. 

Figure 10 is a schematic representation of a 
restriction map of a Brassica napus genomic clone that 
contains a gene (referred to as HPlOl) that is 
constitutively expressed at high levels in all cells 
including developing pollen cells. The portion of the 
clone that was used to provide promoter regions for the 
production of anti-sense RNA is shown, this construct 
gives rise to an anti- sense RNA that contains a region of 
transcribed RNA from this gene. 

Figure 11 is a schematic representation describing 
the production of a gene coding for a polylysine protein. 

Figure 12 is a schematic representation describing 
the production of an anti-sense gene specific to the 
intron region of clone L19 and a restorer that lacks the 
intron region targeted for anti-sense RNA inhibition. 

Figure 13 is a schematic representation describing 
the production of clones containing versions of a ricm 
A chain coding region. 
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Figure 14 is a histogram showing GUS activity in 
plants transformed with sense and anti-sense GUS genes. 

Figure 15 illustrates the procedure used for the 
isolation of the T-DNA gene 2 (the lamH: indole acetamide 
5 hydrolase gene) of the Agrobacterium tumefaciens Ti 
plasmid derivative pPCV 311 (described by Koncz, C, and 
Schell, J., Molecular and General Genetics, 1986, 
204 : 383-3 96) and the construction of a promoterless 
version of this gene. 

10 Figure 16 illustrates the procedure used for the 

isolation of the T-DNA gene 1 (the lamS: indole acetamide 
synthase gene) of the Agrobacterium tumefaciens Ti 
plasmid derivative pPCV 311 and the construction of a 
promoterless version of this gene . 

15 Figure 17 illustrates hybrid seed production using 

the binary cryptocytotoxicity method based on the use of 
the lamH and lamS genes described in Figures 15 and 16. 

Figure 18 illustrates the segregation patterns of the 
lamH and lamS genes in the Fl and F2 populations when the 

20 genes are on the same segregation unit. 

Figure 19 is ,an alternate representation of the 
procedure used to isolate the T-DNA gene 2 (the lamH 
gene) of Agrobacterium tumefaciens and the production of 
a promoterless version of this gene. 

25 Figure 20 illustrates hybrid seed production using 

a variation of the binary cryptotoxicity method and a 
herbicide resistance gene. Genes 1 and 2 can be the lamS 
and lamH genes . The herbicide resistance gene can be any 
resistance gene against any herbicide. The herbicide 

30 resistance gene has a dual role: it results into a hybrid 
plant and seed protected from herbicide treatment and is 
essential for selection of parent A when the parent is 
maintained via a cross with parent A' . The resulting 
hybrid seed is essentially 100% herbicide resistant and 

35 restored in respect to male fertility. 

Figure 21 illustrates hybrid seed production with 
parent A being produced and maintained as shown in 
Figure 20. Parent B is homozygous to herbicide 



wo 96/40950 



- 15 - 



PCT/US96/08692 



resistance. The hybrid is essentially 100% herbicide 
resistant. 

Figure 22a illustrates the use of the binary 
cryptotoxicity method for production of a male sterile, 
herbicide resistant parent A. The resulting parent A is 
essentially 100% male sterile with Gl and G2 on different 
chromosomes. Parent A is herbicide resistant. 

Figure 22b illustrates the method used to maintain 
parent A shown in Figure 22a. Notice use of herbicide 
spraying to maintain the male sterile parent A. 

Figure 22c illustrates the crossing of parent A shown 
in Figure 22a with parent B. Parent B is the result of 
a transformation where the restorer gene and the 
herbicide resistance gene are linked on one recombinant 
DNA molecule. The result of crossing parent A and parent 
B is essentially a hybrid that is restored in regard to 
male fertility and is herbicide resistant. 

Figure 23 is a map of plasmid pPHP5838 which contains 
the BplO promoter operably linked to the JAMS gene and 
the nos 3' nontranslated region. 

Figure 24 is a map of plasmid pPHP58 3 9 which contains 
the BplO promoter operably linked to the lAMH gene and 
the nos 3' nontranslated region . 

Figure 25 is a map of plasmid pPHP5840 which is the 
TA39(8B3) promoter and the 0' nontranslated leader 
operably linked to the JAMS gene and the PI -II 3' 
nontranslated region. 

Figure 26 is a map of plasmid pPHP5841 which contains 
the TA3 9(8B3) promoter and nontranslated leader operably 
linked to the lAMH gene and the PI- II 3' nontranslated 
region. 

Figure 27 is a partial sequence of pPHP5838, where 
the nucleotide coordinates correspond to those used in 
the map of Figure 23. A few nucleotides are presented in 
small letters to indicate uncertainty at that point in 
the sequence. 

Figure 28 is a partial sequence of pPHP583 9, where 
the nucleotide coordinates correspond to chose used m 
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the map of Figure 24 . A few nucleotides are presented in 
small letters to indicate uncertainty at that point in 
the sequence . 

Figure 29 is a partial sequence of pPHP5840, where 
5 the nucleotide coordinates correspond to those used in 
the map of Figure 25. A few nucleotides are presented in 
small letters to indicate uncertainty at that point in 
the sequence . 

Figure 30 is a partial sequence of pPHP5841, where 
10 the nucleotide coordinates correspond to those used in 
the map of Figure 2r: . A few nucleotides are presented in 
small letters to indicate uncertainty at that point in 
the sequence , 

DETAILED DESCRIPTION OF THE INVENTION 
15 As hereinbefore mentioned, the invention relates to 

recombinant DNA molecule (s) for use in the preparation of 
a plant or seed that are male sterile or carry a male 
sterile trait . The male sterile plant is created in 
order to facilitate hybrid crosses. The invention also 

2 0 relates to use of recombinant DNA molecule (s) introduced 

in the male fertile parent of such a cross, resulting in 
a hybrid seed restored in respect to fertility. 
The recombinant DNA molecule (s) for use to produce male 
sterility contains one or more DNA sequences which may be 
25 the same or different, which encode gene product (s) . The 
gene product (s) include a nucleotide sequence (in 
particular a RNA sequence), a protein or a polypeptide. 
When produced in a cell and/or tissue of a plant which is 
essential to pollen formation and/or function, the gene 

3 0 product is directly or indirectly capable of interfering 

with the function and/or development of the cell. The 
presence of the recombinant DNA molecule can be selected 
if the DNA also carries a selective marker gene, 
preferably for herbicide resistance. A cell and/or 
35 tissue of a plant which is essential to pollen formation 
and/or function includes cells and/or tissues that are 
instrumental in the development or function of pollen, 
including cells and/or tissues from which pollen develops 
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(e.gr., premeiotic and uninucleate microspore cells), 
cells and/or tissues which form part of the . male 
structure in which pollen develops (e.g., anther, tapetum 
or filament) and pollen itself. The recombinant DNA 
molecule may also contain- one or more promoters, which 
may be the same or different, which are capable of 
regulating the expression of the DNA sequences' in the 
recombinant DNA molecule. The DNA sequences and 
promoters in the recombinant DNA molecule are selected 
such that the gene product selectively interferes with 
the function and/or development of -a cell of a plant that' 
is essential to pollen formation and/or function. 

The recombinant DNA molecule of the invention may 
contain one or more DNA sequences which may be the same 
or different. The DNA sequences may be selected from the 
following DNA sequences: 

(a) a DNA sequence which encodes an RNA which when 
produced in a cell of a plant which is 
essential to pollen formation and/or function 
substantially interferes with the expression 
of; 

(i) a gene which is essential to pollen 
formation and/or function; 

(ii) a gene which is essential to the 
continued development and/or function 
of all metabolically competent cells 
of a plant; or 

(iii) a gene which confers on a cells of a 
plant resistance to a chemical agent 
or physiological stress; 

(b) a DNA sequence which encodes a protein or 
polypeptide which when produced in a plant cell 
which is essential to pollen formation and/or 
function is substantially cytotoxic to the 
cell; and 

(c) a DNA sequence which encodes a protein or 
polypeptide which when produced in a plant cell 
which is essential to pollen formation or 
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function renders a non-toxic substance 
substantially cytotoxic to the cell; and . 

(d) a DNA sequence which encodes a gene product 
which when produced in a plant cell which is 
essential to pollen formation and/or function 
renders the cell susceptible to a chemical 
agent or physiological stress. 

(e) a DNA sequence which encodes a gene product 
which may be converted into a substance which 
is cytotoxic to cells/tissues essential for 
pollen formation and/or function. 

The above DNA sequences (a) to (e) are further 
described below in relation to the various embodiments of 
the invention. 

According to a first embodiment of the present 
invention genie male sterility may be produced by 
transforming plant cells that are capable of regeneration 
into a differentiated whole plant, with a recombinant DNA 
molecule containing an anti-sense gene which encodes a 
gene product, in particular RNA, which is capable of 
interfering with the expression of a targeted gene. 
Preferably, the recombinant DNA molecule encodes RNA 
which is complementairy to and capable of hybridizing with 
the RNA encoded by the targeted gene. 

An anti- sense gene is a DNA sequence produced when 
a gene is inverted relative to its normal presentation 
for transcription. An anti -sense gene may be constructed 
in a number of different ways, provided that it is 
capable of interfering with the expression of a targeted 
gene. Preferably, the anti-sense gene is constructed by 
inverting the coding region of a gene relative to its 
normal presentation for transcription to allow for 
transcription of its complement; hence the RNA's encoded 
by the anti -sense and the targeted gene are 
complementary. It is understood that a portion of an 
anti -sense gene incorporated in the recombinant DNA 
molecule of the invention may be sufficient to 
selectively interfere with the expression of a gene and 
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thus the term "anti-sense gene" used herein encompasses 
a functional portion of an anti-sense gene. 

The gene targeted for inactivation is a gene 
expressed in cells and/or tissues of a plant that are 
essential to pollen formation and/or function. 
Preferably, the gene is expressed only at specific stages 
during pollen development whose regulation is tightly 
controlled. The gene may also be a gene that is 
essential for the continued development or function of 
all metabolically competent cells/tissues such as but not 
lim.ited to genes involved in essential cellular 
structures, and essential metabolism including essential 
biosynthesis. The gene further may be a gene which 
confers on cells/tissues of a plant resistance to a 
chemical agent or physiological stress . 

It is understood herein that it is not essential to 
identify and isolate the target gene de novo. Rather, 
the gene may have been described in the literature or 
obtained commercially. 

The isolation of genes that are essential to pollen 
formation and/or function may be accomplished by a 
variety of procedures. A detailed description of a 
procedure for isolating such genes is set out below. 

Since the gene in question may initially be isolated 
in its native form, it is' to be understood that the term 
"gene" as used herein may refer to one or more parts of 
the gene including 5' untranslated leader sequences, 
sequences functional in ribosome attachment, coding 
sequences, promoter sequences, intron sequences, and 
untranscribed 3' sequences, or any substantial fragments 
of these sequences . 

The present inventors have isolated DNA sequences 
from a plant of the species Brassica napus ssp oleifera 
w Westar which are expressed only in microspores and 
whose expression is essential to microspore function 
and/or development and hence a preferred embodiment of 
the present invention provides a recombinant DNA molecule 
containing one or more anti-sense genes to these isolated 
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DNA sequences. A schematic representation of the 
restriction maps and coding regions of the microspore 
specific genes identified as L4 , LIO, LIS and L19 are 
shown in Figures 2a, 2b, 2c, and 2d, respectively. The 
5 complete nucleotide sequence of clones L4 , and relevant 
sequences of LIO, L16 and L19 are shown in Figures 3a, 
3b, 3c, and 3d, respectively. The nucleotide sequences 
of isolated cDNA clones that correspond to the genes or 
related gene family members within clones L4, LIO and L19 

10 are shown in Figures 4, 5, and 6, respectively- In a 
preferred embodiment of the invention the recombinant DNA 
molecule contains one or more of an anti-sense gene to 
the microspore specific genes identified as LIO and L19 . 
It is believed that homologous microspore specific 

15 DNA sequences to those isolated from Brassica napus 
described herein will be found and expressed exclusively 
in pollen of other species of pollen-bearing plants, 
particularly species of plants within the genus Brassica 
and the family Cruciferae (also known as Brassicaceae) , 

20 and more particularly other cultivars of Brassica napus. 

The occurrence of the microspore specific sequences 
isolated from Brassica napus in other species of pollen- 
bearing plants may be routinely ascertained by known 
hybridization and DNA polymerase extension techniques. 

25 It is believed that the similarity of plant genes from 
species to species will allow for the embodiments of the 
present invention to be carried out using the DNA 
sequence in any number of pollen bearing plant species 
that are capable of being genetically transformed. The 

3 0 universality of plant genes has been widely documented in 
the literature and homologous plant genes have been 
described for plant actins (Shah, D.M. , et al . , J. Mol . 
Appl. Genet. 2:111-126, 1983), phytochrome (Hershey, 
H.P,, et al., Proc. Natl, Acad. Sci . USA 81:2332-2337, 

35 1984) storage proteins (Singh, N,K., ec al-. Plant Mol. 

Biol. 11:633-639, 1988) enzymes such as glutamine 
synthase (Lighcfoot, D.A. , ec al . . Plant Mol. Biol. 
11:191-202, 1988, and references within) and nitrate 
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reductase (Cheng, C. , et al , , EMBO Jour. 7:3309-3314). 
These and other examples in the literature clearly 
demonstrate that many plant genes are highly conseirved. 
It is also clear that this conservation applies not only 
5 to structural proteins but to enzymatic proteins 
important to cellular physiology. Therefore, it is 
believed that the DNA sequences, when found in another 
plant species, will be essential to microspore 
development and will be useful in carrying out the 

10 present invention in such species. 

It has also he^.n demonstrated that anti-sense RNA 
sequences derived from one plant species can effectively 
inhibit the expression of homologous DNA sequences in a 
different species. Van der Krol, et al . , 1988, Nature 

15 333:866-869. Therefore, it is expected that anti-sense 
RNA derived all or in part from Brassica microspore 
specific DNA sequences will be functional in other 
plants. Furthermore, the identification of the 

homologous gene in other plants by the hybridization 

20 and/or DNA polymerase extension techniques mentioned 
above will allow a person trained in the art to, if 
necessary, design an anti-sense gene which will more 
precisely complement the targeted gene and be more 
effective in inactivating such gene. 

25 As hereinbefore mentioned the targeted gene may also 

be essential for the development and/or function of all 
metabolically competent cells/tissues such as, but not 
limited to, genes involved in essential cellular 
structures, essential biosynthesis and essential 

30 metabolism. Examples of such sense genes may be found in 
the literature and include the genes which encode actin, 
tubulin or ubiquitin; three proteins which are essential 
to cellular growth and development. 

Sequences for actin genes isolated from plants have 

35 been published (for example; Baird W.V,, and Meagher, 
R.B., EMBO J. 6:3223-3231, 1987. or Shah, D.M., 
Hightower, R.C. and Meagher, R.B., Proc Nacl Acad Sci USA 
79: 1022-1026, 1982) and actin Is known to play a 
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critical role in normal cellular function especially 
during mitosis and meiosis where actin forms part of the 
cellular apparatus for cellular division. 

The sequence for plant tubulin has also been 
described (Raha, D., Sen, K. and Biswas, B.B, Plant Mol 
Biol 9:565-571, 1987). Tubulin, like actin, is known to 
be important in the cellular life -cycle particularly in 
regards co cell shape, transport and spindle formation 
during mitosis and meiosis. 

The DNA sequence for plant ubiquitin has also been 
published (Causing, K. and Barka--do,ttir / R. Eur J. 
Biochem 158:57-62, 1986). Ubiquitin is a protein 
involved in the turnover of cellular proteins and as such 
has a critical role in the regulation of specific 
cellular protein levels. In addition, ubiquitin is one 
of the most highly conserved proteins in eukaryotic 
cells. Interference with ubiquitin expression can cause 
abnormalities in the turnover of cellular proteins. 

If any of the aforementioned proteins are either not 
present, or are present but not in a normal functional 
form in the cell, proper cellular function is interfered 
with and the cell fails to develop properly. 

It is believed that a gene that is found to be 
essential for the continued development or function of 
all metabolically competent cells in one plant species 
will have a similar counterpart in other plant species, 
since it is generally understood that within the plant 
kingdom there are genes that are nearly identical or very 
homologous involved in the basic processes that control 
or are a result of cellular development. It is further 
believed that a gene which encodes a gene product which 
interferes with the expression of the gene (i.e., an 
anti-sense gene) in one plant species will have the 
ability to do so in other plant species. 

The tissue-specific and developmental ly regulated 
expression of a wheat endosperm protein synthesized in 
tobacco plants genetically transformed with this wheat 
gene has been reported (Flavell, R.B. , ec al . , Second 
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International Congress for Plant Molecular Biology, 
Abstract #97). In that example, the wheat . gene 
functioned in the tobacco plant in an identical fashion 
to the way in which it functions in a wheat plant . Other 
literature clearly shows that the regulation of a 
specific gene, which can be in many cases complex, is 
maintained in transgenic plants. One example of this is 
the phytochrome mediated regulation of a wheat 
Chlorophyll a/b-binding protein in transgenic tobacco 
(Nagy, F. etal., EMBO Jour. 5:1119-1124, 1986). In this 
example the light responsive speciTic regulation of 'the 
wheat gene was maintained in the foreign genetic 
environment. Not only do cereal genes function in a 
conserved manner, but genes from other plant species that 
are more closely related maintain functionality in 
heterologous genetic systems. Pea seed proteins are 
expressed properly in tobacco plants (Higgins, T.J.V., et 
al.. Plant Mol . Biol. 11:683-696, 1988), as are soybean 
seed proteins, (Barker, S.J., et al . , Proc. Natl. Acad. 
Sci.USA 85:458-462, 1988) and pea rbcS genes (Nagy, F. et 
al., EMBO Jour. 4:3063-3068, 1985). The scientific 
literature has numerous other examples of genes that have 
been used to genetically transform plants and those genes 
maintain their ability to function properly in this new 
genetic environment. Therefore the conserved nature of 
these genes, not only in the DNA sequences which control 
the expression of these genes, but the actual protein 
structure coded for by these genes, is similar among the 
plant species . 

As hereinbefore mentioned, the targeted gene may be 
.a gene which confers on cells of a plant resistance to a 
chemical agent or a naturally occurring or artificially 
incurred physiological stress. Such a gene may be native 
to a plant cell or may be a foreign gene, for example, a 
gene from another plant species. If the gene is a 
foreign gene it may be introduced inco the genome of a 
plant cell prior to. after, or concurrently with the 
anti-sense gene as will be described below. Examples of 
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such genes may be identified in the literature and 
include genes which confer resistance to a herbicide such 
as acetolactate synthase which confers resistance to 
chlorosulfuron; genes which confer resistance to a 
physiological stress such as the gene encoding superoxide 
dismutase which actively reduces oxygen radicals; genes 
which confer resistance to uv light and wounding (phenyl 
and alanine ammonia lyase or chalcone synthase) ; genes 
which confer resistance to disease or pests (e.g., 01,2 
glucanases, chitinase or proteinase) ; genes which confer 
resistance to antibiotics and certain toxic drugs such as 
the gene encoding neomycin phosphotransferase II 
conferring resistance to kanamycin. 

The recombinant DNA molecule containing an ant i- sense 
gene of the present invention additionally contains a 
promoter which regulates the expression of the anti-sense 
gene. If more than one DNA sequence containing an anti- 
sense gene (or one or more of a gene encoding a cytotoxic 
substance or a. gene which confers susceptibility to a 
chemical agent or physiological stress) is present in the 
recombinant DNA molecule of the invention, the DNA 
sequences may be regulated by an identical promoter or 
each may be regulated by a different promoter. 

If the targeted gene is a gene which is only 
expressed in cells/tissues of a plant that are essential 
to pollen formation and/or function, then a promoter that 
functions in all, many, or a variety of cell types 
including cells/tissues essential to pollen formation 
and/or function may be used in the construction of an 
anti-sense gene. An example of such a constitutive 
promoter is CaMV 35S or preferably HP 101 which has been 
isolated from Brassica napus as described below. Thus, 
the promoter that is used may be constitutively active in 
all or many cell types but the anti-sense gene whose 
expression is regulated by such a promoter will only 
interfere with the expression of the targeted gene in 
those cells/tissues producing the targeted gene in 
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question, i.e., cells/tissues essential to pollen 
formation and/or function. 

It is also possible to use an inducible promoter to 
regulate the expression of the ant i -sense gene. An 
5 inducible promoter is a promoter that is capable of 
directly or indirectly activating transcription of one or 
more DNA sequences or genes in response to an inducer. 
In the absence of an inducer the DNA sequences or genes 
will not be transcribed. Typically, the protein factor 

10 that binds specifically to an inducible promoter to 
activate transcription is present in an inactive form 
which is then directly or indirectly converted to the 
active form by the inducer. The inducer may be a 
chemical agent such as a protein, metabolite (sugar, 

15 alcohol etc.), a growth regulator, herbicide, or a 
phenolic compound or a physiological stress such as heat, 
salt, toxic elements etc. or the action of a pathogen or 
disease agent such as a virus. A plant cell containing 
an inducible promoter may be exposed to an inducer by 

2 0 externally applying the inducer to the cell such as by 

spraying, watering, heating, or similar methods. 
Examples of inducible promoters include the inducible 70 
KD heat shock promoter of D. melanogaster (Freeling, M., 
Bennet, D.C., Maize ADN 1, Ann. Rev, of Genetics 19:297- 
25 323) and the alcohol dehydrogenase promoter which is 
induced by ethanol (Nagao, R.T., et al., Miflin, B.J., 
Ed. Oxford Surveys of Plant Molecular and Cell Biology, 
Vol. 3., p. 384-438, Oxford University Press, Oxford 
1986) . The inducible promoter may be in an induced state 

3 0 throughout pollen formation or at least for a period 

which corresponds to the transcription of the sense gene - 
A promoter that is inducible by a simple chemical is 
particularly useful since the male sterile plant can 
easily be maintained by self-pollination when grown in 
35 the absence of such a chemical. 

Alternatively, if the targeted gene is a gene 
essential for the development and/or function of all 
metabolically competent cells/ tissues or a gene which 
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confers on cells of a plant resistance to a chemical 
agent or physiological stress, a promoter that is active 
only in cells/tissues essential to pollen formation 
and/or function (for example, a promoter essential for 
pollen development and/or function) is preferably used to 
regulate the expression of the anti-sense gene. 

The invention also relates to a promoter essential 
for pollen development and/or function or functional 
portions of the promoter thereof and chimeric promoters 
constructed using the promoter essential for pollen 
development and/or function and/or portions thereof. 

A promoter essential for pollen development 
and/or function is a DNA sequence which regulates the 
expression of a DNA sequence selectively in the 
cells/tissues of a plant essential to pollen formation 
and/or function and/or limits the expression of such a 
DNA sequence to the period of pollen formation in the 
plant . 

Preferably, the promoter essential for pollen 
development and/or function is a DNA sequence 
corresponding to the promoter sequence in the microspore 
specific genes identified as L4 , LIO, LIS and L19 herein 
or a functional fragment thereof; or a chimeric promoter 
sequence containing one or more of a promoter sequence 
from the microspore specific genes identified as L4 , LIO, 
L16 and L19 or portions of such promoter sequences . The 
preferred promoter essential for pollen development 
and/or function may be used in conjunction with naturally 
occurring flanking coding or transcribed sequences 
described herein or with any other coding or transcribed 
sequence that is essential to pollen formation and/or 
function. 

The promoters essential for pollen development and/or 
function referred to above were isolated from a plant of 
the species Brassica napus . It is believed that the it 
is possible to use these promoters to limit the 
expression of a given DNA sequence to pollen formation 
and/or function and to a specific period during pollen 
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formation and/or development from a plant of a different 
species. The published scientific literature has clearly 
shown that plant genes are universal and the plant 
tissue-specific promoter fragments retain their function 
in other species. For example, wheat endosperm promoter 
fragments function to give appropriate seed specific 
expression in tobacco (Simpson, J. ec al . . EMBO Jour. 
4:2723-2729, 1985) and the alcohol dehydrogenase promoter 
(Adh-l) from corn (^ea mays) can be used in conjunction 
with other promoter fragments to give appropriate 
expression in tobacco (Ellis, J. G., et al . , 1987, EMBO 
J. 6: 11-16). Additionally the maize transposable 
element Ac is active in tobacco and other plant species 
(Taylor, et al . , 1989, Plant Mol . Biol. 13: 109-118) 
providing further evidence of the universality of plant 
gene structure and function. These examples demonstrate 
the equivalent tissue specific function of the same 
promoters in very widely divergent species (monocots to 
dicots) . Research studies have shown equivalent promoter 
function in more closely related species within the same 
family or between families such as the Solanacea^. and 
Brassicaceae . In a preferred application, the promoter 
isolated from the anther-specific LIO gene discussed 
above is the promoter employed to regulate expression of 
the ant i -sense gene. 

In a further preferred application, the anther- 
specific TA39 promoter from tobacco is employed to 
regulate expression of the anti-sense gene. The anther- 
specific TAB 9 promoter is from a genomic clone from a 
commercially available genomic library from Nicotiana 
tabacum L. var NK 326 (Clontech) . The library was probed 
with the anther-specific cDNA TA3 9 obtained from Dr. 
Robert B. Goldberg of UCLA. The isolation of TA39 and its 
expression characteristics have been reported (Koltunow, 
A. M., Truettner, J. . Cox. K. H. , Wallroth, M. and 
Goldberg R. B. Different temporal and spatial gene 
expression patterns occur during anther development. 
Plant Cell. 2:1201-1224, and Goldberg, R. B.. Beals. T. 
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P., and Sanders, P. 1993. Anther Development: Basic 
Principles and Practical Applications. Plant Cell 5: 
1217-1229 (1990) . 

It is appreciated, however, that refinements in 
promoter function may be required for individual plants 
or species to maximize or modulate the appropriate timing 
or level of expression to carry out aspects of the 
invention. Accordingly, methods for the modification of 
promoters to modify or improve function in various plants 
of different origin are provided herein. 

It is noted that there is not a high degree of DNA 
sequence homology between the native promoter essential 
for pollen development and/or function from the L4 , LIO, 
L16 and L19 clones. Test data reveal that the timing and 
level of expression of these genes is not identical in 
pollen, but that all overlap in activity at some time. 

It is expected that one may use any number of 
different promoters essential for pollen development 
and/or function to regulate the expression of a DNA 
sequence in a recombinant DNA molecule of the invention. 
In order to determine what promoter essential for pollen 
development and/or function may be used to regulate the 
expression of an anti-sense gene (or other sequences 
which will be discussed below in regard to other 
embodiments of the invention) consideration is to be 
given to a number of factors. 

The promoter essential for pollen development and/or 
function used should be a promoter (or a modified form of 
the promoter) that is active at the appropriate time to 
produce sufficient levels of transcribed RNA to carry out 
the invention. The promoter essential for pollen 
development and/or function derived from pollen specific 
clones disclosed herein (namely those derived from the 
microspore genes identified as L4 , LIO, L16 and L19) that 
are active early in the development of microspores such 
that gene expression takes place both during and after 
the meiotic and mitotic division of pollen mother cells. 
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Thus, the activity of these promoters is not limited by 
segregation. 

When using a promoter essential for pollen 
development and/or function to inactivate a gene that is 
essential to pollen formation and/or function, as 
hereinbefore discussed, it may be difficult to predict, 
a priori , which promoter essential for pollen development 
and/or function or modified promoter construct will 
effectively block the function of such a gene. It is 
preferable to use a promoter essential for pollen 
development and/or function that displays a similar 
developmental pattern to the gene. A convenient method 
to determine when the gene targeted for interference is 
expressed is to isolate RNA from developing microspores 
at different stages and to analyze this RNA for the 
expression of the gene by the so-called Northern blot 
analysis. This procedure will allow for the 

determination of the developmental period in which the 
gene is expressed. in order to determine the 

developmental period in which a promoter essential for 
pollen development and/or function gene is expressed, a 
similar series of analyses can be carried out using as a 
probe a reporter gene such as beta -glucuronidase joined 
to the promoter essential for pollen development and/or 
function, or a native gene from the same plant species 
from which a promoter essential for pollen development 
and/or function is isolated and whose expression is 
regulated by the promoter essential for pollen 
development and/or function. When the promoter essential 
for pollen development and/or function is isolated from 
one plant and used in a different plant species the 
preferred method is the use of a reporter gene joined to 
the promoter to determine the exact developmental timing 
that the promoter has in that particular plant species. 

It is understood that the activity of a promoter 
essential for pollen development and/or function, whether 
intended for the same or different species may be 
modified in structure to change or alter activity in a 
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plant . Changes that are contemplated include but are not 
necessarily limited to: addition or deletion of 
sequences, orientation of upstream or downstream 
sequences, and the inclusion of introns or parts of the 
coding sequence of the pollen specific gene. The above 
modification may serve to increase expression or improve 
regulation of expression to targeted stages of 
development , 

It should be noted that the identification of a 
promoter region (including constitutive, inducible and 
promoter essential for pollen development and/or 
function) is usually defined by function rather than a 
set DNA sequence. Two hundred (200) nucleotide bases or 
less of a promoter sequence may be sufficient to maintain 
promoter function. It should also be recognized that 
some upstream DNA sequences can be arranged in opposite 
orientations and still retain or demonstrate enhanced 
promoter function. In addition, "enhancer-like" DNA 
sequences, which are usually small conserved DNA 
sequences ranging in size from less than 10 nucleotides 
to considerably larger numbers of nucleotides may also be 
inserted into promoter regions to enhance expression. 

It may also be desirable to include some intron 
sequences in the promoter constructs since the inclusion 
of intron sequences in the coding region may result in 
enhanced expression. Thus, it may be advantageous to 
join the DNA sequences to be expressed to a promoter 
sequence that contains the first intron and exon 
sequences of a polypeptide which is unique to 
cells/tissues of a plant essential for pollen formation 
and/or function. 

Additionally regions of one promoter may be joined 
to regions from a different promoter in order to obtain 
the desired promoter activity. Specific examples of 
chimeric promoter constructs are the chimeric promoters 
contained in the vectors PAL1107 and PAL,1106. 

It IS desiraible that the promoter essential for 
pollen development and/or function (or constitutive or 
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inducible promoter whichever is selected to construct a 
recombinant DNA molecule of the invention) function so 
that sufficient levels of anti-sense RNA are provided to 
substantially interfere with the expression of the 
targeted gene. Investigations of the mechanism of anti- 
sense RNA inhibition of gene expression in model systems 
have suggested that equal or greater than equal levels of 
anti- sense RNA may be required in order to observe a 
significant reduction of the targeted gene activity. 
However, in some cases it is noted that low levels of 
anti-sense RNA can nave a specific reduction in sense 
gene activity. Therefore, in some instances if the gene 
that is targeted for inactivation by anti -sense RNA is a 
gene that is found to be essential for the continued 
development or function of all metabolically competent 
cells, or a gene that is expressed in all cell types at 
a low level, an excess of anti -sense RNA may not be 
required for inhibition. Additionally less than total 
reduction of the gene activity may be more .than 
sufficient to disrxipt pollen development which is known 
to be very sensitive to many stressful conditions. 
Therefore, it is suggested that the promoter essential 
for pollen development and/or function that is used, to 
carry out certain aspects of this invention, be chosen 
based on the observation that the promoter essential for 
pollen development and/or function functions to cause the 
expression of any sequences adjacent to it to be 
transcribed at a time that parallels or overlaps the 
period of time that the gene sought to be inactivated is 
expressed and that the levels of anti -sense RNA expressed 
from the anti-sense gene be of levels sufficient to 
inhibit the targeted gene's expression, usually to mean 
greater than or equal to the levels of sense RNA. 

By using a promoter essential for pollen development 
and/or function to regulate the expression of an anti- 
sense gene, it is possible to interfere with for example, 
normal microspore development in any given plant, without 
having first to isolate from the genomic DNA of the plane 
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a gene which is essential to microspore development. 
Thus, a male sterile plant may be produced where the. gene 
targeted for interference is a gene that is essential for 
the development and/or function of all metabolically 
competent cells/tissues. To produce a male sterile 
plant, such a gene is specifically interfered with by 
using a recombinant DNA molecule containing a promoter 
essential for pollen development and/or function to 
regulate the transcription of the anti-sense gene. 

Furthermore, a male sterile plant may be produced by 
growing a plant which has a recorbinant DNA molecule 
comprising an anti-sense gene to a gene which confers on 
cells of a plant resistance to a chemical agent or a 
naturally occurring or artificially induced physiological 
stress and a promoter essential for pollen development 
and/or function. The transcription product of the anti- 
sense gene will interfere with the expression of the 
targeted gene in cells/tissues essential to pollen 
formation and/or function. The gene confers on the 
remainder of the plant resistance to the stress. 

In order to interfere with the expression of a gene, 
it is preferred that the anti- sense gene and targeted 
gene be expressed at about the same time. The co- 
incident expression may be achieved in a variety of ways 
using combinations of constitutive, inducible and organ 
specific promoters (for example, a promoter essential for 
pollen development and/or function as hereinbefore 
described) . -However, co- incident expression may be 
readily achieved by regulating the expression of the 
anti -sense gene with the same promoter that controls the 
targeted gene, thereby causing both to be transcribed in 
the same time frame. The concept of regulating gene 
expression using anti -sense genes is described in 
Weintraub, H. et al., Antisense RNA as a molecular tool 
for genetic analysi-q- Reviews — Trends in Genetics, Vol. 
1(1) 1986. 

It is preferable chat the recombinant DNA molecule 
containing an anti-sense gene of the invention further 
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contain one or more selection marker genes which encode 
a selection gene product which confers on cells/tissues 
of a plant resistance to a chemical agent or 
physiological stress, such that plant cells transformed 
5 with the recombinant DNA molecule or plants containing 
such transformed plant cells may be easily selected using 
a selective agent. Any effective selective agent for 
which a resistance gene has been identified may be used 
to select transformed plant cells. The selective agent 

iO used to select plant cells transformed with the 
recombinant DNA of the invention ccruld be said to fall 
within two broad non-mutually exclusive categories, a 
chemical agent and a physiological stress. Examples of 
selective agents which may be used to select plant cells 

15 are herbicides, antibiotics, toxic substances and plant 
diseases, in particular glufosinate, glyphosate, 
tabtoxinine-S-lactam (TBL) , chlorsulf uron, kanamycin, 
hygromycin, methotrexate, bleomycin, and phosphinotricin. 
Preferred selection marker genes which may be used in the 

20 present invention to facilitate selection of transformed 
cells include the gene encoding the enzyme hygromycin 
phosphotransferase conferring resistance to hygromycin 
and the enzyme neomycin phosphotransferase (NPT II) which 
confers resistance to kanamycin. The most preferred 

25 resistance markers include resistance genes to herbicide 
agents such as glufosinate (commercial names are Basta, 
Liberty) glyphosate (commercial name is Roundup) and TBL, 
Examples of selective agents which may be used to 
select male sterile plants produced using a recombinant 

30 DNA molecule of the invention include a chemical agent, 
such as a herbicide, which could be used on a commercial 
scale and therefore can provide an economic benefit 
during the plant growing season additional and separate 
from the use of herbicide as a selective agent of a plant 

35 transformed by recombinant DNA. Resistance genes include 
any genes that allow the transformed cell to survive 
treatment with the selective agent. That includes, but 
is not limited to. a detoxification agenr , interference 
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with transport of the selective agent, or over-production 
of the molecule targeted by the selective agent. 
Examples of herbicides for which a resistance gene has 
been identified which may be useful as selective agents 
for plants are glyphosate whose commercial name is 
Roundup (described in Comai, L., Facciotti, D. , Hiatt, 
W.R., Thompson, G. , Rose, R. E., Stalker, D. M., 1985, 
Nature, Vol. 317, Pages 741-744), chlorsulf uron 
(described in Haughn, G. W., and Somerville. C. R. , 1986, 
Mol. Gen. Genet., Vol. 210, Pages 430-434) and ^ 
phosphinotricin (Murakami T, et al . , Mol. Gen. Genet. 
205:42-50, 1986). Another herbicide for which a 
resistance gene or a fragment of DNA containing 
resistance gene(s) were reported is TBL (Kinscherf et 
al., 1991, J. Bact.i73: 4124-32, and from yeast, Marek, 
E.T. and Dickson, R.C., 1987, J. Bact . 169: 2440-8). 

Genes encoding proteins or polypeptides that can 
provide a distinctive phenotypic characteristic to plant 
cells such as a gene that encodes a protein providing a 
distinguishable color or morphology to plant cells and 
genes which encode proteins which confer stress tolerance 
such as the metallothiones gene may also be used as 
selection marker genes to facilitate the selection of 
transformed plants . 

For a general reference on selective agents see 
Weising, K. , Schell, J., and Kahl, G. Ann. Rev. of 
Genetics, 22:421-477 (1988). 

The selection marker gene may be expressed in 
particular cells or constitutively in the entire plant 
depending on the nature of the selection gene product. 
The expression of the selection marker gene(s) is 
regulated by one or more selection marker promoters. The 
selection gene marker promoters may be isolated using 
techniques known in the art and include constitutive 
promoters such as CaMV 35S. 

To facilitate selection of plant cells transformed 
with a recombinant DNA molecule or male sterile plants 
containing a recombinant DNA molecule, it is preferred 
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that a selection marker gene be contained in the 
recombinant DNA molecule, most preferably linked to the 
DNA sequence contained in the recombinant DNA molecule. 
Where the recombinant DNA molecule contains more than one 
5 DNA sequence it is preferred that each DNA sequence be 
linked to a different selection marker gene. It is 
contemplated in some embodiments of the invention that at 
least two recombinant DNA molecules be used to produce a 
male sterile plant and in such embodiments it is 
10 preferable that each recombinant DNA molecule contain a 
selection marker gene preferably linked- to the. DNA 
sequence . 

In a hybrid seed production scheme where there are 
alternating rows of male sterile plants and male fertile 

15 plants, it may be advantageous to carry out the final 
selection of male steriles in the field alongside the 
male fertile donors. Therefore it is desirable if the 
suitable male fertile donors are previously transformed 
to resistance to the selective agent to avoid having to 

20 selectively apply the selective agent to the rows of male 
sterile plants. 

It is to be understood that a plant sought to be 
rendered male sterile need not be transformed with a 
selection marker gene that confers resistance to a 

25 chemical agent or a naturally occurring or artificially 
induced physiological or chemical stress if this gene is 
native to the plant. 

The recombinant DNA molecule containing an anti-sense 
gene of the present invention preferably also contains a 

30 termination sequence and/or other transcription 
regulation signals.. Examples of termination sequences 
which may be used in the recombinant DNA molecules of the 
invention are the nos terminator found in pRAJ-221 
(Clonetech Laboratories. Palo, Alto, CA) . 

35 A detailed description of the isolation and 

identification of genes which are only expressed in 
cells/tissues of a plant essential to pollen formation 
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and/or function and promoters essential for pollen 
development and/or function is set out below.: 

To isolate genes which are only expressed in 
cells/tissues of a plant that are essential to pollen 
formation and/or function and promoters essential for 
pollen development and/or function, a genomic library of 
plant DNA may be constructed from DNA isolated from fresh 
young leaves according to standard methodology ( Molecula-r 
Cloning, a Laboratory Manual Maniatis, T., Fritsch, E. 
F,, and Sambrooks, J., Cold Spring Harbour Laboratory, 
Cold Spring Harbour, New York, 1982) and screened with 
probes derived from several tissues, one of which is made 
from RNA specific to cells/tissues of a plant that are 
essential to pollen formation and/or function 
(hereinafter referred to as pollen specific RNA) . The 
other probes should be made from RNA from different 
tissues so as to represent genes expressed in tissues of 
the plant that would not be expected to include genes 
that are expressed in cells/tissues of a plant that are 
essential to pollen formation and/or function. Examples 
include but are not limited to such tissues as leaf, 
roots, seeds, stigma, stem and other organs. Some genes 
are expressed in all tissues • and some only in a 
restricted number of tissues, by comparison of many plant 
tissues it is possible to isolate genes expressed 
exclusively in cells/tissues of a plant that are 
essential to pollen formation and/or function. 

Pollen specific RNA may be isolated from 
cells/tissues of a plant that are essential to pollen 
formation and/or function that are at the early to late 
uninucleate stage. Although it is possible to isolate 
pollen specific RNA at other stages, isolation of the 
pollen specific RNA may be technically difficult at 
earlier developmental stages,- older cells may have 
limited nuclear gene activity; and, promoters may not be 
suitable for use as described in the present invention. 
For a detailed description of the isolation of pollen 
specific RNA see Example lA herein. 
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Pollen specific RNA may be labelled for the purpose 
of detection. It is convenient to make radioactive cDNA 
by using the pollen specific RNA and AMV reverse 
transcriptase in the presence of random hexanucleotide 
5 primers and alpha- [^^P] -dCTP . Probes are used for 
hybridization to nitrocellulose plaque, lifts of plates 
containing the clones of the genomic library. Clones 
that can be identified as strongly hybridizing only to 
pollen specific cDNA and not cDNA from any other tissue 

10 examined are chosen. These clones are plaque purified 
and grown for DNA isolation. Alternative techniques for 
manipulation of DNA and RNA as well as recombinant DNA, 
growing and isolating clones can be found in standard 
laboratory manuals, such as Molecular Cloning. A 

15 Laborat ory Manual (Maniatis, T. , Fritsch, E. F., and 
Sambrook , J., Cold Spring Harbour Laboratory, New York, 
1982) . 

For applications where the genomic DNA sequence of 
L4 , LID, L16 or L19 from Bra^ssica. napus are used to carry 

2 0 out certain aspects of this invention, the preferred 

method to obtain a gene that is essential to pollen 
formation and/or function is to synthetically produce a 
homologous DNA sequence according to standard methodology 
(see Gait, M. J.., Ed., (1984} Oligonucleotide synthesis, 
25 . a practical approach, pp 1-22, IRL Press, Oxford, U.K.), 
label the sequence for the purpose of detection and use 
the labelled sequence to screen a Brasslca napus genomic 
library produced according to the methods described. 

The identity of the promoter and coding region of a 

3 0 given genomic clone is determined by restriction mapping 

and hybridization analysis. This may be accomplished by 
hybridization of cDNA probes made from pollen specific 
RNA with restriction fragments of the DNA clones 
immobilized on nitrocellulose. Restriction endonuclease 
3 5 fragments which contain both the coding region and 
regions of DNA on either side of the coding region are 
isolated by sub-cloning in appropriate vectors. Once 
isolated, it is convenient to use techniques such as SI 
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mapping and DNA sequencing to obtain exact coding regions 
and restriction sites within the sub-cloned DNA. . This 
analysis is easily accomplished once the polarity with 
respect to gene transcription is known. 

In order to determine the polarity of transcription 
of a gene, individual restriction fragments may be sub- 
cloned in commercially available vectors such as pGEM3 , 
pGEM4,or PGEM3Z, pGEM4Z (available from Promega Biotech, 
Madison, Wisconsin, USA) . By using these vectors one is 
able to generate single stranded RNA probes which are 
complementary to one or the other strands of the DNA* 
duplex in a given sub-clone. These strand specific 
probes are hybridized co mRNA, in order to establish, the 
polarity of transcription. Among these probes, one can 
isolate those probes which hybridize with and hence are 
complementary to the mRNA. Using this information it is 
possible to clearly determine from what DNA strand of the 
double-stranded genomic DNA molecule the mRNA has been 
transcribed. 

In order to delineate and isolate the promoter DNA 
sequences of a given genomic clone the pGEM series of 
vectors can be used for the unidirectional deletion of 
sequences from the individual sub- clones in 
hybridization-protection experiments . Detailed 
descriptions of these experimental procedures can be 
found in a number of laboratory handbooks and in the 
manufacturers technical notes supplied with the pGEM 
series of vectors. These experiments will clearly 
establish the promoter and coding regions of the pollen 
specific genomic, clones . 

The sequence of individual deletions in the pGEM 
vectors can be determined by dideoxy sequencing of 
plasmid mini-preps as described in the manufacturer's 
technical notes. Deletion sub-clones that are deleted to 
very near the start of transcription or specific 
resnriccion fragments chac encompass the promoter region 
or the promocer region and the scare of cranscription are 
chosen for the construction of genes that are expressed 
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only in developing microspores of pollen bearing plants. 
Usually the promoter fragment is inserted upstream of a 
terminator such as the nos terminator found in pRAJ-221 
(available from Clonetech Laboratories, Palo Alto, CA) 
and specific restriction fragments which are to be 
transcribed into anti-sense RNA are inserted between the 
promoter and terminator sequences. The entire construct 
is verified by combination of sequencing and restriction 
digests. The anti -sense gene thus constructed and 
verified may be inserted in T-DNA based vectors for plant 
cell transformation. T-DNA vectors that contain a 
selectable marker are preferred. It is to be understood 
that the anti-sense gene can be constructed in a variety 
of ways depending on the choice of vectors, restriction 
enzymes and individual genes used. For example, it may 
be convenient to insert restriction fragments intended to 
be transcribed into anti -sense RNA into a T-DNA based 
vector to which a promoter and terminator structure have 
been previously added. Alternatively, it is possible to 
insert a promoter fragment upstream of a coding region 
and terminator that has been previously added to a T-DNA 
based vector. In addition, it may be desirable in some 
crops not to insert the anti-sense gene into a T-DNA 
based vector but rather into a vector suitable for direct 
DNA uptake. Promoters other than promoters essential for 
pollen development and/or function can be used and joined 
with specific restriction fragments of genes and 
terminators provided that these promoters function in 
cells and/or tissues essential to pollen formation and/or 
function. 

In accordance with a second embodiment of the 
invention, a method is provided for the production of a 
male sterile plant by transforming a plant with a 
recombinant DNA molecule comprising a promoter essential 
for pollen development and/or function as hereinbefore 
described, and a DNA sequence which encodes a gene 
product which renders a cell/tissue essential to pollen 
formation and/or function susceptible to a chemical agent 
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or physiological stress . Such a recombinant DNA molecule 
can be introduced into a plant wherein all the cells of 
the plant in which the recombinant DNA molecule is not 
expressed are resistant to the chemical agent and/or 
physiological stress. 

In accordance with a third embodiment of the 
invention a method is provided for the production of a 
male sterile plant by transforming a plant with a 
recombinant DNA molecule comprising a promoter essential 
for pollen development and/or function as hereinbefore, 
described and a DNA sequence which encod ^s a protein or 
polypeptide which is cytotoxic to a cell which is 
essential to pollen formation and/or function. By 
substance which is cytotoxic to a cell is meant a 
substance that when active disturbs the normal function 
of a plant cell/tissue preferably leading to cell death. 
Any cytotoxic substance which is known to be encoded by 
one or more identifiable DNA sequences may be employed 
within the scope of this embodiment of the invention, 
including but not limited to ricin, abrin and diphtheria 
toxin. Thus, the DNA sequence may encode any substance 
which is cytotoxic to a cell that is essential to pollen 
formation and/or function including the genes encoding 
ricin, abrin and diphtheria toxin. 

In addition to DNA sequences encoding cellular toxins 
such as ricin, abrin and diphtheria toxin, the 
recombinant DNA molecule of the present invention may 
encode other cytotoxic substances including degrading or 
destructive enzymes such as ribonuclease, DNAse, 
ribozymes. lipase, or protease, substances that disrupt 
or de-stabilize cytoplasmic integrity such as polylysine 
or poly-proline, substances that disrupt or de-stabilize 
proteins or polypeptides which are essential to certain 
biosynthetic pathways in plant cells or interfere with 
the expression of such proteins or polypeptides. 

In accordance with a fourth embodiment of the 
invention, the recombinant DNA molecule may comprise one 
or more DNA sequences which encode a gene product which 
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a 



renders a non-toxic substance cytotoxic ho a cell/tissue 
which is essential to pollen formation and/or function. 
In particular, any identifiable DNA sequences encoding 
gene products which can convert a non-toxic to 
cytotoxic substance can be employed within the scope of 
the invention. Examples of such DNA sequences include 
the following: 

a) DNA sequences which encode indole acetamide 
hydrolase (lamH) which converts naphthalene acetamide to 
the plant growth regulator alpha naphthalene acetic acid 
(NAA) which is toxic to developing pollen grains, or 
converts indole acetamide to indole acetic acid (lAA) 
which is a plant growth regulator. One source of the 
enzyme laMH is the bacterium Aarobactf^-r-i nm tumefaciens 
(Inze, D., etal., 1984, Mol . Gen. Genet. 194:265-74), 

b) DNA sequences which encode the enzyme methoxinine 
dehydrogenase (MDH) which converts non- toxic 2-amino-4- 
methoxy-butanoic acid (methoxinine) to toxic methoxyvinyl 
glycine. One source of MDH is the bacterium Pseudomonas 
aeruginosa (Margraff, R. , et al . , 1980, Experimentia 36: 
486) . 

c) DNA sequences which encode the enzyme 
rhizobitoxine synthase which converts 2-amino-4-methoxy- 
butanoic acid to rhizobitoxine (2 -amino-4 - [2 -amino-3 - 
hydroxypropyl] - trans -3 -butanoic acid) . One source of the 
enzyme is the bacterium Rhizobium iaponicum (Owens, L.D. , 
et al., 1973, Weed Science 21: 63-66). 

d) DNA sequences which encode the tissue specific 
beta-glucuronidase enzyme which for example converts a 
non- toxic analog of glucuronic acid conjugated to a toxic 
substance such as chloramphenicol (Gluc-Camp) or 
glyphosate (N- [phosphomethyl] glycine) to cytotoxic 
substances such as chloramphenicol and glyphosate. 
respectively. Other toxic substances may be conjugated 
to glucuronic acid to provide a non- toxic analog of 
glucuronic acid. 

Where the non- toxic substance does not occur 
naturally in the plant cell into which the recombinant 
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DNA molecule of the fourth embodiment of the invention is 
to be introduced, the non-toxic substance may be 
introduced into the plant cell by spraying, watering or 
other similar means. The non-toxic substance may be 
produced in the cell by providing in the recombinant DNA 
molecule having a DNA sequence which encodes a gene 
product which renders the non- toxic substance cytotoxic 
to the cell, or in a second recombinant DNA molecule, a 
second DNA sequence which encodes the non- toxic 
substance. The non-toxic substance may also be 

introduced into the cell by providing in the recombinant 
DNA molecule having a DNA sequence which encodes a gene 
product which renders the non-toxic substance cytotoxic 
to the cell, or in a second recombinant DNA molecule, a 
second DNA sequence which encodes a gene product which 
converts a substance which is endogenous to the cell to 
the non-toxic substance. For example, a cell may contain 
a recombinant DNA molecule having a DNA sequence which 
encodes lamH (which converts indole acetamide to 
cytotoxic levels of indole acetic acid) , and a 
recombinant DNA molecule having a DNA sequence which 
encodes lamS . lamS converts tryptophan which is 
generally endogenous to plant cells, to indole acetamide 
which in turn is converted by lamH to cytotoxic levels of 
indole acetic acid. In the examples above, where the 
introduction of more than one DNA sequence may be 
desired, the sequences can be introduced on separate 
recombinant DNA molecules or, in a preferred case, they 
can be linked together on one molecule. 

The recombinant DNA molecule comprising one or more 
DNA sequences which encode a gene product which renders 
a non-toxic substance cytotoxic to a cell/tissue which is 
essential to pollen formation and/or function 
additionally contains one or more promoters to regulate 
the expression of the DNA sequences. If there is more 
Chan one DNA sequence in the recombinant DNA molecule the 
expression of the DNA sequences may be regulated by an 
identical promoter or the expression of each DNA sequence 
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may be regulated by a different promoter. Preferably, 
the expression of the DNA sequences is regulated, by a 
promoter essential for pollen development and/or function 
as hereinbefore described so that the gene product which 
5 renders a non- toxic substance cytotoxic selectively 
interferes with the function and/or development of cells 
essential to pollen formation and/or function. Where the 
non-toxic substance is introduced into the plant cell by 
providing a second recombinant DNA molecule having a 

10 second DNA sequence encoding a second gene product which 
converts a substance which is endogenous to the cell to 
a non-toxic substance, the expression of the DNA sequence 
and the second DNA sequence may be regulated by either a 
promoter essential for pollen development and/or 

15 function, an inducible promoter or a constitutive 
promoter so long as there is selective interference with 
the function and/or development of cells essential to 
pollen formation and/or function. Preferably the 
expression of the DNA sequence encoding a gene product 

2 0 which renders a non- toxic substance cytotoxic is 
regulated by a promoter essential for pollen development 
and/or function and the expression of the second DNA 
sequence encoding a gene product which converts a 
substance which is endogenous to the cell to a non-toxic 

25 substance is regulated by a constitutive promoter, an 
inducible promoter or a promoter essential for pollen 
development and/or function, although most preferably by 
a promoter essential for pollen development and/or 
function. For a more detailed discussion of the above - 

30 mentioned promoters reference may be made to the 
discussion of promoters in reference to the recombinant 
DNA molecules containing anti-sense genes. A preferred 
promoter would be the microspore -specific BplO promoter 
(Albani, D., Sardana, R. , Robert, L.S., Altosaar, I., 

35 Arnison, P.G., and Fabijanski, S.F., A Brassicct napus 
gene family which shows sequence similarity to ascorbate 
oxidase is expressed in developing pollen. Molecular 
characterization and analysis of promoter activity in 



wo 96/40950 



- 44 - 



PCTAJS96/08692 



transgenic tobacco plants. Plant J". 2: 331-342 (1992) or 
the anther- specif ic TA3 9 promoter hereinbefore described. 

In accordance with a fifth embodiment of the 
invention, the recombinant DNA molecule comprises one or 
5 more DNA sequences which encode a gene product which may 
be converted into a substance which is cytotoxic to 
cells/tissues essential for pollen formation and/or 
function. 

The recombinant DNA molecule of the second, third, 
10 fourth and fifth embodiments of the invention also 
preferably further contain one or more selection marker 
genes as hereinbefore described and termination sequences 
and other transcription regulation signals as 
hereinbefore described. The preferred selection marker 
15 would be herbicide resistance gene(s) to glufosinate or 
glyphosate or TBL, described hereinbefore. 

The following description sets forth, in general 
terms, the steps that can be employed to produce plants 
which are male sterile or carry the male sterile trait, 
20 using the recombinant DNA molecules of the invention. It 
is to be understood that these various steps may be 
accomplished by a variety of different procedures. In 
the following description of procedures, alternative ways 
to accomplish these steps are disclosed. Preferred 
25 procedures may be detailed below and in the examples. 
However, it is contemplated that other variations will be 
apparent to those skilled in the art. 

The recombinant DNA molecules of the invention may 
be used to produce a plant which is male sterile or 
30 carries a male sterile trait, according to the following 
preferred steps : 

(1) Introducing one or more of the recombinant DNA 
molecules, as contemplated herein for carrying 
out the invention, into the genome of one or 
35 more plant cells, of a plant selected from 

those species of pollen producing planes which 
are capable of being genetically transformed. 
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preferably cells from which differentiated 
whole plants may be conveniently recovered; 

(2) Selecting a plant cell into which the 
recombinant DNA molecule (s) is stably 
incorporated; and 

(3) Regenerating from the selected plant cell a 
plant which is male sterile or carries the male 
sterile trait. 

A male sterile plant may be preferably produced by 
the above-mentioned steps by using the following 
recombinant DNA molecules of the inven^'ion in step (1) T 

a) A recombinant DNA molecule comprising a DNA 
sequence which encodes an RNA which when 
produced in a cell of a plant which is 
essential to pollen formation and/or function 
interferes with a gene which is essential to 
pollen formation and/or function and a promoter 
essential for pollen development and/or 
function or a constitutive promoter; 

b) A recombinant DNA molecule comprising a DNA 
sequence which encodes an RNA which when 
produced in a cell of a plant which is 
essential to pollen formation and/or function 
interferes with a gene which is essential to 
the continued development and/or function of 
all metabolically competent cells of a plant 
and a promoter essential for pollen development 
and/or function; and 

c) A recombinant DNA molecule comprising a DNA 
sequence which encodes a protein or polypeptide 
which when produced in a cell of a plant which 
is essential to pollen formation and/or 
function is substantially cytotoxic to the cell 
or renders a non-toxic substance cytotoxic to 
the cells, where the non- toxic substance is 
introduced mco the cell by converting a 
subscance endogenous to the cell cc che non- 
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toxic subsnance, and a promoter essential for 
pollen development and/or function, 
A plant carrying a male sterile trait may be 
preferably produced by the above mentioned steps by using 
the following recombinant DNA molecules of the invention 
in step (1) : 

(a) A recombinant DNA molecule comprising a DNA 
sequence which encodes an RNA which when 
produced in a cell of a plant which is 
essential to pollen formation and/or function 
interferes with a gene which is essential to 
pollen formation and/or function and an 
inducible promoter; 

(b) A recombinant DNA molecule comprising a DNA 
sequence which encodes a protein or polypeptide 
which renders a cell of a plant which is 
nominally resistant to a chemical agent or 
physiological stress sensitive to said chemical 
agent or physiological stress and a pollen- 
specific promoter; and 

(c) A recombinant DNA molecule comprising a DNA 
sequence which encodes a protein or polypeptide 
which when produced in a cell of a plant which 
is essential to pollen formation and/or 
function renders a non- toxic substance 
substantially cytotoxic to the cell, where the 
nontoxic substance is externally applied to the 
cell, or where the non- toxic substance is 
introduced into the cell by converting a 
substance endogenous to the cell of the non- 
toxic substance and an inducible promoter 
controls the expression of the protein or 
polypeptide which converts the endogenous 
substance to the non- toxic substance. 

A plant carrying a male sterile trait may be rendered 
male sterile by exposing the plane co a scerility 
actuacing agent which for example is an inducer in the 
case of (a) above, a chemxcal agent or physiological 
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Stress in the case of (b) above, or a non- toxic substance 
which is capable of being rendered cytotoxic (i.e., a 
cryptocytotoxic substance) to a cell which is essential 
to pollen formation and/or function in the case of (c) 
5 above , 

A recombinant DNA molecule may be introduced into a 
plant cell by any one of a variety of known methods 
preferably by first inserting the recombinant DNA 
molecule into a suitable vector and then using the vector 

10 to introduce the recombinant DNA molecule into a plant 
cell , As hereinbefore described a recombinant DNA 
molecule may additionally contain a selection marker gene 
which encodes a selection gene product which confers on 
a plant cell resistance to a chemical agent or 

15 physiological stress, or confers a distinguishable 
phenotypic characteristic to the cells such that plant 
cells transformed with the recombinant DNA molecule may 
be easily selected using a selective agent. Transformed 
plant cells thus selected can be induced to differentiate 

20 into plant structures which will eventually yield whole 
plants. The selection agent can preferably be a 
herbicide such as glufosinate, glyphosate or TBL and the 
selection marker can be any gene conferring to the plant 
cell resistance to the herbicide. This approach has the 

25 added benefit that the resulting plant would also be 
herbicide resistant. 

The use of Cauliflower Mosaic Virus (CaMV) (Howell, 
S, H., et al, , 1980, Science 208: 1265) and gemini 
viruses (Goodman, R. M., 1981, J. Gen. Virol. 54: 9) as 

30 vectors has been suggested but by far the greatest 
reported successes have been wich . Aarobacteria sp. 
(Horsch, R. B., et al . , 1985, Science 227: 1229-1231). 
Methods for the use of Agrobacterium based transformation 
systems have now been described for many different 

35 species. Generally strains of bacteria are used that 
harbour modified versions of the naturally occurring Ti 
plasmid such that DNA is transferred to the host plant 
without the subsequent formacion of tumours. These 
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methods involve the insertion within the borders of the 
Ti plasmid the DNA to be inserted into the plant genome 
linked to a selection marker gene to facilitate selection 
of transformed cells. Bacteria and plant tissues are 
cultured together to allow transfer of foreign DNA into 
plant cells then transformed plants are regenerated on 
selection media. Any number of . different . organs and 
tissues can serve as targets for Aarobacterium mediated 
transformation as described specifically for members of 
t-he Brassicaceae . These include thin cell layers 
(Charest, P. J., et al., 1988, Theor. Appl . Genet. 
75:438-444), hypocotyls (DeBlock, M. , et al . , 1989, Plant 
Physiol. 91:694-701), leaf discs (Feldman, K.A., and 
Marks, M. D., 1986, Plant Sci . 47:63-69), stems (Fry J, , 
et al-. Plant Cell Repts. 6:321-325 (1987), cotyledons 
(Moloney M.M. , et al . , Plant Cell Repts. 8:238-242 (1989) 
and embryoids (Neuhaus, G., et al., Thoer. Appl, Genet. 
75:30-36 (1987), It is understood, however, that it may 
be desirable in some crops to choose a different tissue 
or method of transformation. 

It is also understood that the successful 
transformation and recovery of a plant that contains 
these recombinant sequences may not always result in 
appropriate pollen specific expression. The 
transformation procedure results in the random insertion 
of foreign DNA such that "position effects" may override 
and suppress the activity of any introduced DNA. It is 
thus advisable to generate a number of individual 
transformed plants with any recombinant construct in 
order to recover individuals free from any limiting 
position effects. It may also be preferable to select 
plants that contain more than one copy of the introduced 
recombinant DNA molecule such that high levels of 
expression of the recombinant molecule are obtained. 

It is known that the number of species of plants that 
have been successfully genetically transformed still 
represents a modest percentage of the total number of 
plant species that are of potential commercial interest. 
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It is true, however that the number of species that have 
been transformed has increased steadily and there is 
every reason to expect that transformation systems can be 
developed for any crop of interest in due course . 
5 Routine transformation was initially achieved with 
species from two plant families: . Solanaceae and 
Brassicaceae . Examples of species of commercial interest 
from within these families that have been transformed 
include: tobacco, Nicotiana tabacum L. tomato, 

10 Lvcopersicon esculentum Mill, potato, Solanum tuberosum 
L,, and petunia. Petunia hybrida ( Solanaceae ) : 
Canola/Rapeseed, Brassica napus L. , cabbage, broccoli, 
kale etc, Brassica oleracea L., mustards, Brassica 
iuncea L . , Brassica nigra L . , and Sinapis alba L . 

15 ( Brassicaceae ) . 

Recently transf oirmation has been reported of 
commercially important species from other families such 
as sugar beet. Beta vulgaris , ( Chenopodiaceae ) , cucumber, 
Curcurbita sp . ( Curcurbitaceae ) . cotton, Gossvpium sp., 

20 ( Malvaceae ) , sunflower, Helianthus annuus and lettuce 

Lactuca sativa, ( Asteraceae ^ Compositae ) , and pea. Pi sum 
sativum , soybean. Glycine max and alfalfa, Medicago sp 
( Fabaceae = Ijeguminoseae ) • Transformation has also been 
achieved with tree species such as poplar, Populus sp. 

25 ( Salicaceae ) and walnut, Juglans nigra . ( Juglandaceae ) , 

Transformation success with monocotyledonous species 
has not progressed as rapidly, since these species are 
generally not very susceptible to Agr oba c t e r i urn mediated 
transformation. However, progress which has been 

3 0 noteworthy includes asparagus. Asparagus officinalis ; 

gladiolus, Gladiolus sp. , ( Lilaceae ) ; corn, Zea mavs and 
rice, Orvza sativa (Poaceae) . The recent discovery that 
transformation with Agrobacterium can be accomplished by 
infecting germinating seeds without the requirement of 

35 regeneration from cell culture (Chee, P.P., et al . , Plant 
Physiol. 91:1212-1218 (1989) opens new horizons for 
species that may be difficult to regenerate. 
Additionally, widespread studies on the use of particle 
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10 



15 



guns to transfer microprojectiles coated in DNA into 
plant cells of species that are not readily susceptible 
to other methods holds great promise. It is expected 
that the present invention may be carried out with any 
one of the above species and with any other species that 
is capable of being genetically transformed. 

The most widely used and generally successful methods 
of introducing foreign DNA into plants are dependent upon 
the use of an infectious agent, such as the Aarobacterium 
tumef aciens Ti plasmid, as a vector for delivery of the 
foreign DNA as hereinbefore described. However, it may 
be possible to use other methods, as well. Other methods 
that have been employed involve mechanical means such as 
direct DNA uptake, liposomes, electroporation (Guerche, 
P. et ai., 1987, Plant Science - 52 : 111-16) and micro- 
injection (Neuhaus, G. , etal., 1987, Theor. Appl , Genet. 
75: 30-36). Recently the possibility of using 

microprojectiles and a gun or other devise to force small 
metal particles coated with DNA into cells has received 
considerable attention (Klein, T.M. et al . , 1987, Nature 
327: 70-73). To date success with this and other 
mechanical methods has not been widely reported. The 
method of choice will be dependant on the particular 
plant species and will be apparent to those skilled in 
25 the art. In those plant species where a successful 
transformation has not yet been demonstrated, it is 
anticipated that any new method of transformation 
developed may be used to insert the recombinant DNA 
molecules of the invention and the actual method of 
3 0 insertion of the genes will have little or no effect on 
the functioning of the crop production systems described 
herein. 

It may also be possible to produce plants which are 
male sterile or carry the male sterile trait by fusing 
35 cells of a plant cell line containing cells having one or 
more recombinant DNA molecules of the invention with 
cells of plant species that cannot be transformed by 
standard methods. A fusion plant cell line is obtained 



20 
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that carries a genetic component from both plant cells. 
Fused cells that carry the recombinant DNA molecule can 
be selected and in many cases regenerated into plants 
that are male sterile or carry the male sterile trait. 

It is contemplated that some embodiments of the 
present invention may require that a plant cell be 
transformed with a recombinant DNA molecule containing at 
least two DNA sequences or be transformed with more than 
one recombinant DNA molecule. The DNA sequences or 
recombinant DNA molecules in such embodiments may be 
physically linked, by being in the same vector, or 
physically separate on different vectors. A cell may be 
simultaneously transformed wlch more than one vector 
provided that each vector has a unique selection marker 
gene. Alternatively, a cell may be transformed with more 
than one vector sequentially allowing an intermediate 
regeneration step after transformation with the first 
vector. Further, it may be possible to perform a sexual 
cross between individual plants or plant lines containing 
different DNA sequence or recombinant DNA molecules. 
Preferably the DNA sequences or the recombinant molecules 
are linked or located on the same chromosome. It would 
then be possible to select from the progeny of the cross, 
plants containing both DNA sequences or recombinant DNA 
molecules . 

Where at least two recombinant DNA molecules are 
necessary to interfere with the function and/or 
development of a cell of a plant that is essential to 
pollen formation and/or function (e.g., a first 
recombinant DNA molecule contains a first DNA sequence 
encoding a first gene product which converts a non-toxic 
substance to a cytotoxic substance and the non- toxic 
substance is introduced into the cell by means of a 
second recombinant DNA molecule which contains a second 
DNA sequence which encodes a second gene product which 
convercs a substance endogenous co a plant cell co che 
non-coxic substance) . che recombinant DNA molecules may 
be linked co prevenc segregation of DNA sequences 
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required to produce the desired effect. Alternatively, 
a male sterile plant can be produced by crossing, two 
plant lines, each containing one of the recombinant DNA 
molecules, where the plant lines are otherwise isogenic. 
Each plant line is preferably made homozygous, for the 
respective recombinant DNA molecules to ensure that all 
the progeny receive a copy of each of the recombinant DNA 
molecules. Further, in some hybridization schemes 
discussed below, it may be preferable to have the 
respective recombinant DNA molecules located in the same 
chromosome pair in each line. The choice of a chromosome 
pair containing a first recombinant DNA molecule in one 
plant line may be predetermined, randomly, and the other 
plant line may be adjusted so that a second recombinant 
DNA molecule is located on the same chromosome pair. For 
example, to produce a plant line which is homozygous for 
a first recombinant DNA molecule, a chromosome pair, in 
which a second recombinant DNA molecule is located is 
identified in a plant line, a transformed cell in which 
the first recombinant DNA molecule is incorporated on the 
identified chromosome pair is then selected, a plant is 
regenerated from the transformed cell, the plant is 
selfed, and a plant is selected which is homozygous for 
the trait encoded by the first recombinant DNA molecule, 
and the number of homozygous planes are increased, by 
selfing in isolation. 

Methods of identifying the chromosome pair in which 
a recombinant DNA molecule is located and methods for 
producing homozygotes are discussed in detail below. 

As indicated above, it may be desirable to produce 
plant lines which are homozygous for a particular gene. 
In some species this is accomplished rather easily by the 
use of anther culture or isolated microspore culture. 
This is especially true for the oil seed crop Brassica 
napus (Keller and Armstrong, Z. Pf lanzenzucht 80: 100- 
108, 1978) . By using these techniques, it is possible to 
produce a haploid line that carries the inserted gene and 
then to double the chromosome number either spontaneously 
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or by the use of colchicine. This gives rise to a plant 
that is homozygous for the inserted gene, which can be 
easily assayed for if the inserted gene carries with it 
a suitable selection marker gene for detection of plants 
carrying that gene. Alternatively, plants may be self- 
fertilized, leading to the production of a mixture of 
seed that consists of, in the simplest case, three types, 
homozygous (25%) , heterozygous (50%) and null (25%) for 
the inserted gene. Although it is relatively easy to 
score null plants from those that contain the gene, it is 
possible in practice to score the homozygous from 
heterozygous plants by southern blot analysis in which 
careful attention is paid to the loading of exactly 
equivalent amounts of DNA from the mixed population, and 
scoring heterozygotes by the intensity of the signal from 
a probe specific for the inserted gene. It is advisable 
to verify the results of the southern blot analysis by 
allowing each independent transformant to self -fertilize, 
since additional evidence for homozygosity can be 
obtained by the simple fact that if the plant was 
homozygous for the inserted gene, all of the subsequent 
plants from the selfed seed will contain the gene, while 
if the plant was heterozygous for the gene, the 
generation grown from the selfed seed will contain null 
plants. Therefore, with simple selfing one can easily 
select homozygous plant lines that can also be confirmed 
by southern blot analysis. 

Two techniques may be used to produce plant lines 
which carry genes that segregate in a similar fashion or 
are on the same chromosome or a set of chromosome pairs. 
One may be a simple crossing strategy in which two 
transf ormants that are homozygous for a single inserted 
gene are crossed to produce Fl seed. The progeny plants 
from the Fl seed (Fl plant generation) may be crossed 
with a recipient plant and the segregation of the two 
inserted genes is determined (F2 plant generation) . For 
example, where the lamH and lamS genes are the inserted 
genes, the Fl plants grown from the Fl seed will be male 
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Sterile. If the original transf ormants are homozygous 
for a single inserted gene, when crossed with a -non- 
transformed plant to produce F2 seed, the F2 plants will 
be 100% male fertile if the two transf ormants originally 
used for the production of the Fl seed carried the lamH 
and the lamS genes on the same chromosome or in the same 
linkage group. . ..if the genes are in. separate linkage 
groups or on different chromosomes, a variable degree of 
male sterility will be seen, in theory 25% of the plants 
will be male sterile if the genes segregate completely 
independently of each other. This approach allows for 
the selection of breeding lines from the homozygous 
transformed plant lines that contain the lamS and lamH 
genes which will segregate substantially 100% in the 
hybrid seed sold for commercial use. 

An alternative strategy may make use of extensive 
genetic maps available for many commercially grown crops 
and the many easily scoreable markers that .are known for 
most linkage groups or chromosomes. In some cases, 
linkage groups and chromosomes may be equivalent, whereas 
in others, there may be more than one linkage group 
assigned to each chromosome. When there is a marker for 
each chromosome, identification of the chromosome into 
which the recombinant gene has been inserted is 
relatively simple. A cross is made between each 
individual transformant and a recipient plant that allows 
for visualization of the marker (s) . 

If there are scoreable markers that have been 
localized to each of the chromosomes in the plant, and 
the markers are scoreable in the generation produced by 
this cross, one can localize the segregation of the 
inserted gene with the marker, thereby establishing the 
chromosomal location of that gene. This therefore allows 
for the chromosomal or more importantly the linkage group 
with which the inserted gene segregates. Many crops such 
as com, tomato and many cereal crops have extensive 
genetic maps that allow for the identification of the 
chromosome containing the inserted gene. It is 
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contemplated that as more detailed chromosome maps are 
made, especially with the use of RFLP (restriction 
fragment length polymorphism) maps, the assignment of 
inserted genes to particular chromosomes will easily be 
5 done for most commercial crop species. 

As a means of confirmation, or in plant species where 
chromosomal markers are not known, it is possible to use 
a technique called pulse- field electrophoresis 
(originally described by Schwartz and Cantor, Cell, 37: 

10 p67; 1984) to determine if different transformed plants 
contain inserted genes on the same chromosome. Pulse- 
field electrophoresis is a technique that can separate 
large DNA pieces, even chromosomal size, into a 
reproducible pattern on a gel* When this is done, it is 

15 possible to process this gel such that the chromosome 
spots can be analyzed by southern blotting techniques, 
localizing the inserted gene to a chromosome spot. When 
the entire population of primary transf ormants are 
analyzed in this fashion, it is a simple task to choose 

20 the two transf ormants that carry the inserted genes on 
the same chromosome spot . 

As hereinbefore mentioned, after the recombinant DNA 
molecule (s) is introduced into the genome of a plant 
cell, a plant cell having a recombinant DNA molecule 

25 stably integrated in its genome is selected. This 
selection step may be facilitated by incorporating a 
selection marker gene in the recombinant DNA molecule 
such that plant cells transformed with the recombinant 
DNA molecule may be easily selected using a selective 

3 0 agent as hereinbefore described. A selection marker gene 
is neomycin phosphotransferase (NPT II) which confers 
resistance to kanamycin and the antibiotic G-418. Cells 
transformed with this selection marker gene may be 
selected for by testing in vitro phosphorylation of 

35 kanamycin using techniques described in the literature or 
by testing for the presence of the mRNA coding for the 
NPT II gene by Northern bloc analysis in RNA from the 
tissue of the transformed plant. A preferred selective 
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agent and selective marker would be a herbicide and its 
herbicide resistance gene(s) as described hereinbefore. 

Expression of the recombinant DNA molecules of the 
invention in transformed plant cells may be monitored 
using Northern blot techniques. Single stranded RNA 
probes which are homologous to transcripts of the coding 
sequences of a recombinant DNA molecule (s) may be used to 
detect for the presence of the recombinant DNA molecule 
in a plant cell or tissue so that the expression of the 
coding sequence can be ascertained. It is preferable to 
use agarose gel electrophoresis to separate transcripts- 
according to size under denaturing conditions. In the 
case where pollen specific gene expression of the coding 
sequence is sought to be accomplished it is advisable to 
test for the expression of the coding sequence in the 
cell type whose function and/or development is to be 
interfered with and in other tissues, such as leaves, 
roots, etc., so that tissue specific gene expression of 
the recombinant DNA molecule in such cell types can be 
verified. 

The presence of a stably integrated recombinant DNA 
molecule in the genome of the plant cell may also be 
ascertained by using Southern blot techniques. In this 
procedure, total cellular or nuclear DNA is isolated from 
the transformed plant or plant cell and preferably 
digested with a restriction enzyme, thereby giving rise 
to discrete fragments . These discrete fragments may be 
detected in the nuclear or total DNA of the transformed 
plant or plant cells by employing standard gel 
electrophoresis, followed by nucleic acid hybridization 
techniques • 

The formation of microspores in plants which contain 
the recombinant DNA molecule (s) of the invention and 
which are male sterile, is first monitored by visual 
microscopic examination of the anther structure. As 
maturation of the flower occurs, anther formation is 
expected to be delayed or completely inhibited such that 
no mature pollen grains are formed or released. 
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It will be appreciated that plants produced using the 
method for producing male sterile plants of the present 
invention may display varying degrees of male sterility. 
This may be a result of the nature of the plant cell 
transformed or of the chance positioning of the 
recombinant DNA molecule in the genome of the plant cell . 

The following description of the invention sets 
forth, in general terms, the steps that can be employed 
to increase the number of male sterile plants and plants 
carrying the male sterile trait and to produce restorer 
plants, hybrid seed, hybrid seed with restored fertilif- 
and hybrid seed yielding a mixture of male fertile and 
male sterile plants. It is to be understood that these 
various steps may be accomplished by a variety of 
different procedures. In the following description of 
procedures, alternative ways to accomplish these steps 
are disclosed. Preferred procedures may be detailed 
below and in the examples. However, it is contemplated 
that other variations will be apparent to those skilled 
in the art . 

As hereinbefore mentioned, the invention relates to 
hybrid seed having a genome comprising one or more of the 
recombinant DNA molecule (s) of the invention for 
producing plants which are male sterile or carry the male 
sterile trait and to seed of plants grown from the hybrid 
seed. The invention is also directed to a mixture of 
fertile and sterile hybrid seed and to seed of plants 
grown from said hybrid seed. The invention is also 
directed to hybrid seed having a genome comprising one or 
more of the recombinant DNA molecules of the invention 
for producing a male sterile plant and a corresponding 
restorer gene product for restoring fertility in the seed 
and to seed of plants grown from the hybrid seed. 

To produce hybrid seed on a commercial scale from a 
plant which is male sterile or carries the male sterile 
traic, Che number of such planes must be increased or 
maintained and crossed with a suitable male fertile 
parent line. 
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A plant which carries the male sterile trait may be 
more readily maintained because such plant will .self- 
pollinate unless treated with a sterility actuating agent 
which renders the plant substantially sterile. For 
example, in a plant containing a recombinant DNA molecule 
having a DNA sequence which encodes a gene product that 
renders a cell susceptible to a chemical" agent or 
physiological stress and having a promoter essential for 
pollen development and/or function, the plant will be 
rendered substantially male sterile by treating the plant 
with an actuating agent, namely the chemical agent or 
physiological stress. Accordingly, plants which carry 
the male sterile plant are preferably maintained by 
selfing, selecting from the selfing progeny a plant which 
is homozygous for the male sterile trait, and increasing 
the number of plants which are homozygous for the male 
sterile trait by selfing in isolation over a number of 
generations. Selection for a plant which is homozygous 
for the male sterile trait may also be carried out by the 
procedures discussed above. 

A plant which is substantially male sterile is 
preferably maintained by crossing the male sterile plant 
with a suitable male fertile plant, obtaining seed from 
plants resulting form the cross, growing plants from the 
seed and selecting the plants which are male sterile from 
among the plants grown from the seed. This process may 
be repeated over a number of generations until the 
desired number of male sterile plants is obtained. 

A plant which carries the male sterile trait may also 
be maintained by the latter method. 

To facilitate selection of male sterile plants in the 
"maintained" plants, the recombinant DNA molecule (s) used 
to produce the male sterile plant should preferably 
comprise a selection marker gene and preferably the 
section marker gene is linked to a DNA sequence (s) 
contained in the recombinant DNA molecule as hereinbefore 
described. Such selection marker could be a herbicide 
resistance gene. 
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A suitable male fertile plant for the purpose of 
crossing with a male sterile plant to increase the number 
of male sterile may be, but is not limited to, a plant of 
the same inbred line from which the male sterile plant is 
derived. In some instances referred to below, 

maintenance of the male sterile line can be produced 
simply by selfing in isolation. 

According to another scheme the number of male 
sterile plants may be increased by clonal propagation 
using tissue explants thereof, or other in vitro 
propagation techniques. 

Where cost is warranted, and maintenance cannot be 
readily accomplished as discussed above, transformed 
plant cells may be grown in culture according to routine 
methodology to produce a cell line. A cell line may be 
regenerated according to routine methodology to increase 
and maintain the male sterile cell line. Routine methods 
for culturing cell lines and regenerating transformed 
plants from cell lines is described in standard plant 
tissue culture hand books. ( Plant Tissue and Cell 
Culture , Green, C..E., Somers, D. A., Hackett W. P., and 
Biesboer, D. D. Eds, 1987, Alan R. Liss, Inc., New York, 
Experiments in Plant Tis sue Cul mi^>> Dodds, J. .H. and 
Roberts, L. W. Eds, 1985, Cambridge University Press, or 
SSlI — Structure and .g omatic Cell Genetics of Plants . 
Vasil, I. K., Scowcroft, W.R., and Frey K. J . ,. Eds . , 1984, 
Academic Press, New York, Handbook of Piann Cell Culcur-^ . 
Vol 1-4, Evans, D. A. , Sharp, W. R. , Ammirato, P. V. , and 
Yamada, Y. Eds ., 1984-1986 , Macmillan, New York, Bio- 
technology in AaricuT ture and Forest: ry . Vol 1 and 2, 
Bajaj, Y. P. S. Ed., 1986, Springer- Verlag, Berlin, or 
SlMn^ — Propagation bv Tissue Cultiure - Handbook and 
Directory of Commereial Labors r. ot- -i >:>g George, E. F. , and 
Sherrington, P.' D.. 1984, Eastern Press, Reading). 

Production of hybrid seed may be accomplished by 
pollination of male scerile plants with pollen derived 
from selected male fertile plants. Pollination can be by 
any means, including but not iimiced to hand, wind or 
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insect pollination, or mechanical contact between the 
male fertile and male sterile plant. For production of 
hybrid seeds on a commercial scale in most plant species 
pollination by wind or by insects are preferred. 
Selection of plants for pollen donation is determined by 
standard crossing of different plants with subsequent 
analysis of the progeny and selection of lines with the 
best combining ability and superior agronomic traits. 
Restoration of fertility in the hybrids may be 
accomplished by using the methodology detailed below. 

For certain crops of interest, such as' vegetables, 
it may only be the leaves, stems or roots of the plant 
that are sold commercially. Therefore, even though the 
recombinant DNA molecule (s) which renders the plant male 
sterile may be inherited and expressed in the hybrid 
plant it is not necessary to overcome or restore male 
fertility in the seed of the hybrid plant. However, for 
other crops, the commodity of commerce may be the seed or 
fruit produced. by the hybrid plant. Thus for optimal 
commercial utility of the hybrid it may be desirable to 
produce hybrid seed that is fertile. 

The invention contemplates a variety of recombinant 
DNA molecules that may be used to produce a male sterile 
plant. In a scheme where any such recombinant DNA 
molecule (s) consists of a single gene or linked genes, 
the gene(s) may segregate as a unit to produce a mixture 
of sterile hybrid seed and fertile hybrid seed. 
Similarly unlinked recombinant molecules may segregate to 
produce a mixture of fertile and sterile seed except, as 
discussed below, where the molecules are located on 
different chromosomes of the same chromosome pair. In 
outcrossing species, the plants which grow from the 
fertile seed present may allow full pollination of the 
male sterile hybrid plants. Therefore, it may be 
preferable to adopt a scheme to produce hybrid seed which 
is fully fertile. However in species which are 
relatively weakly outcrossing, male sterile hybrid plants 
will not be fully pollinated, thereby reducing the seed 
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yield. Therefore, where the commodity of commerce is the 
seed produced by the hybrid plant, it may be desirable to 
produce hybrid seed which is fully fertile. 

The invention contemplates methods for restoring 
fertility in hybrid plants produced in accordance with 
the methods of the invention. 'Specifically, the 
invention contemplates a method for restoring hybrid 
plants produced in accordance with the methods of the 
invention by incorporating into a male parent plant, a 
restorer recombinant DNA molecule containing a restorer 
gene ^^hich encodes a restorer gene product which 
compensates for a gene function that has been compromised 
by a gene product (s) encoded by a first recombinant DNA 
molecule (s) of the invention, or which negates the effect 
caused by a gene product (s) encoded by a first 
recombinant- DNA molecule (s) of the invention, as 
discussed below. The male parent plant thus may act as 
a "restorer plant". The restorer plant may be made 
homozygous for the gene encoding the restorer trait to 
ensure that all progeny inherit a gene. The homozygous 
restorer plant may be "maintained" by selfing in 
isolation to produce a "restorer line". Selection for a 
plant that is homozygous for the restorer trait may be 
carried out, as described above, by conducting anther or 
isolated microspore culture of the genetically 
transformed plant carrying the restorer trait, or 
preferably by selfing the plant in isolation prior to 
selection. 

The expression of a restorer gene may be regulated 
by any promoter that is active during the period of 
transcription of a first recombinant DNA molecule which 
encodes a gene product which substantially interferes 
with the function and/or development of a cell essential 
to pollen formation and/or function. It is preferred 
that the expression of the restorer gene be regulated by 
the same promoter used to regulate the expression of the 
fi^rst recombinant DNA molecule or any promoter that is 
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highly active in tissues that are essential to pollen 
formation and/or function. 

Methods for restoring fertility in hybrid plants 
which have integrated in the genome of their cells 
recombinant DNA molecules of the invention are discussed 
below. 

The action of a protein or polypeptide which is 
substantially cytotoxic to a cell of a plant which is 
essential to pollen formation and/or function, and which 
is encoded by a DNA sequence contained in a recombinant 
DNA molecule which is integrated in the genome of cells 
of a hybrid plant may be negated by regulating the 
expression of the DNA sequence encoding the protein or 
polypeptide. For example, an anti- sense gene to the gene 
encoding the protein or polypeptide may be incorporated 
into a male parent line. 

In particular where the recombinant DNA molecule has 
a DNA sequence which encodes a protein or polypeptide 
which is substantially cytotoxic to a cell of a -plant 
which is essential to pollen formation and/or function, 
the male sterile plant may be crossed with a suitable 
male fertile plant that has been previously transformed 
with a restorer recombinant DNA molecule. The restorer 
recombinant DNA molecule may contain a restorer DNA 
sequence which is in the anti -sense orientation to that 
of the DNA sequence encoding the cytotoxic protein or 
polypeptide and a promoter controlling the restorer DNA 
sequence which activates transcription of the restorer 
DNA sequence at about the time of transcription of the 
DNA sequence. The restorer gene product inhibits the 
expression of the DNA sequence encoding the protein or 
polypeptide in the hybrid plant. The promoter 

controlling the expression of the restorer DNA sequence 
is preferably the same promoter essential for pollen 
development and/or function that controls the expression 
of the cytotoxic protein or polypeptides. Inducible and 
constitutive promoters may also be advantageously used to 
control the expression of the restorer DNA sequence . 
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A cytotoxic method of producing male sterile plants 
may involve the synthesis of proteins or polypeptides 
capable of substantially interfering with the function 
and/or development of a cell/tissue which is essential to 
pollen formation and/or function. Methods for restoring 
fertility in male sterile plants produced by cytotoxic 
methods involve the synthesis, in cells/tissues that are 
essential to pollen formation or function, of restorer 
gene products that specifically block, neutralize or 
destroy the proteins or polypeptides. If the recombinant 
DNA molecule integrated in the genome of a hybrid plant- 
has a DNA sequence which encodes a cellular toxin, a 
restorer plant containing DNA sequences coding for a 
detoxifying molecule may be used to restore fertility. 
If the recombinant DNA molecule integrated in the genome 
of a hybrid plant has a DNA sequence which encodes a 
destructive enzyme, a restorer plant containing DNA 
sequences coding for a specific enzyme inhibitor may be 
used to restore fertility. if the recombinant DNA 
molecule integrated in the genome of a hybrid plant has 
a DNA sequence which encodes a cytoplasmic disrupting 
molecule, a restorer plant containing DNA sequences 
encoding a specific peptidase may be used to restore 
fertility. Judicious choice of the geneexpression 
regulation mechanism (s) for the male sterility producing 
agent would allow the use of gene expression mechanisms 
as restorer gene sequences. One example would be the use 
of the lexA mechanism ( Brenn, R. and Ptashne, M. S., 
Regulation — of — Eukarvnl- jc aene exr>ressir.n Patent* 
4,833,080. 1989). Specific examples of cytotoxic 
proteins or polypeptides and their restorer gene products 
are the proteinase enzyme trypsin and soybean or cowpea 
trypsin inhibitor; ribonuclease and a ribonuclease 
inhibitor; or a starch degrading enzyme such as alpha- 
amylase and an alpha-amylase inhibitor. 

Fertility in a hybrid plant containing in the genome 
of its cells a recombinant DNA molecule having an anti- 
sense gene may be restored as follows. An anti-sense 
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gene contained in a recombinant DNA molecule to be 
incorporated in the genome of a plant may be selected 
such that it contains sequences of a sense gene which are 
transcribed, but not translated, in an anti- sense 
5 orientation. The transcribed but untranslated sequence 
may include an untranslated 5' leader sequence, 
intervening sequences and an untranslated 3' sequence, or 
any substantial fragments of these sequences. It is to 
be understood that the sequences or fragments thereof may 

10 be naturally occurring or foreign sequences. 

Thus, any hybrid plr nt produced from a cross with a 
plant containing such an anti-sense gene may be restored 
by crossing with a restorer plant that has been 
transformed with a modified form of the initially 

15 targeted gene which does not contain the regions 
complementary to the anti -sense gene and therefore is not 
subject to the anti-sense regulation. 

As described above, a plant carrying a male sterile 
trait may be produced by integrating various recombinant 

20 DNA molecules of the invention into the genome of a plant 
cell and regenerating the plant from the plant cell. The 
development and/or function of cells that are essential 
for pollen formation and/or function in the plant are 
interfered with only after the plant is exposed to a 

25 sterility actuating agent such as an inducer, a toxic 
agent or a cryptocytotoxic substance. Thus, restoration 
is inherent in growing plants produced from hybrid seed 
in the absence of the sterility actuating agent. 

Where more than one recombinant DNA molecule of the 

30 invention is used to produce a male sterile plant, the 
recombinant DNA molecules may be inserted in the same 
chromosome pair in separate isogenic plant lines. The 
respective lines are preferably made homozygous for the 
respective recombinant DNA molecule (s) /gene prior to 

35 crossing the lines to produce a male sterile plant. 
Where a first and a second recombinant molecule are 
integrated into the same chromosome in the isogenic plant 
lines, a cross of these lines results in the first and 
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second recombinant DNA molecules being located on 
separate chromosomes of the same chromosome pair in the 
male sterile plant. Consequently, when the male sterile 
plant is crossed with a suitable male fertile plant of a 
different line, both chromosomes of the chromosome pair 
segregate into separate Fl progeny with. the result that 
the first and -second recombinant DNA molecules are not 
expressed in the same plant. Thus, the Fl hybrid seed is 
fully fertile. If the two recombinant DNA molecules are 
integrated into different chromosomes in the male sterile^ 
plant, then a portion of the Fl hybrid -^eed will be male 
sterile since there is a 25% probability of co- 
segregation of the chromosomes containing both 
recombinant DNA molecules into the male sterile plant. 
This latter approach may be advantageous with respect to 
outcrossing species. When the Fl male fertile plants 
outcross, a portion of the F2 seed will inherit both 
chromosomes containing the first and second recombinant 
DNA molecules and consequently will be male sterile. 
Where the seed is the commodity of commerce, it is 
advantageous for seed producing companies to use a scheme 
for hybrid seed production, where the saving of Fl hybrid 
seed is discouraged. The outcrossing in the Fl hybrid 
plants results in partial male sterility in the F2 
generation, thereby reducing the seed yield of F2 plants, 
which is commercially desirable. An example of this 
method is as follows: a first male sterile plant line 
incorporating in its genome a recombinant DNA molecule 
having an lamH gene encoding lamH which converts non- 
toxic lAM to toxic levels of lAA, may be crossed with a 
second plant line having a genome incorporating a second 
recombinant DNA molecule having an lamS gene which 
converts tryptophan to lAM. 

In a preferred method of the invention for producing 
hybrid seed a first male sterile plant line having a 
genome xncorporating a recombinant DNA molecule having a 
first DNA sequence encoding a protein or polypeptide 
which, renders a non-toxic substance substantially 
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cytotoxic to a cell of a plant which is essential to 
pollen formation and/or function and a promoter essential 
for pollen development and/or function is crossed with a 
second plant line which contains a second recombinant DNA 
molecule having a second DNA sequence which encodes a 
second gene product which converts a substance which is 
endogenous to a plant cell to the non- toxic substance. 
Preferably, the first and second plant lines used in this 
method are isogenic and each line carries a homozygous 
loci for the first DNA sequence or the second DNA 
sequence. Most preferably the first and second DNA 
sequences are located on the same chromosome pair of the 
plant lines, such that in any cross of the two lines a 
single chromosome pair contains both the first and second 
DNA sequences. The first plant line is made male sterile 
by exposing the first plant line to the non-toxic 
substance. The protein or polypeptide encoded by the 
recombinant DNA molecule incorporated in the genome of 
the first plant line will render the non- toxic substance 
toxic in cells of the plant which are essential to pollen 
formation and/or function, thus producing a male sterile 
plant line. The male sterile plant line also preferably 
has a selection marker gene linked to the first DNA 
sequence encoding the protein or polypeptide which 
renders a non-toxic substance cytotoxic to facilitate 
harvesting of the seeds having cells containing the first 
and second DNA sequences . 

When the first male sterile plant line and the second 
plant line are crossed, the first male sterile plant line 
produces seeds having cells containing the DNA sequence 
encoding the non-toxic substance (e.g., JAM) and the DNA 
sequence encoding the protein or polypeptide (lamH) which 
renders the non-toxic substance cytotoxic (e.g., lamH 
converts lAM to toxic level of lAA) . The seed having 
cells containing the first and second DNA sequences will 
produce male sterile plants which may be pollinated with 
a male fertile line to produce commercial hybrid seed. 
If the first and second DNA sequences are located on the 
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same chromosome or in the same linkage group, the DNA 
sequences will segregate completely in the Fl hybrid seed 
and the hybrid seed will be substantially male fertile. 

Advantage is taken in the above-mentioned preferred 
method of the fact that most plant species produce, per 
plant, many hundreds of seeds. In oilseed Brassica for 
example, one plant, under normal conditions can produce 
one thousand seeds. Using the method described above, 
one can expect a thousand- fold increase in seeds per unit 
area sprayed with the non- toxic substance. That is to 
say that, for example, when two isogenic lines arr 
produced that carry the lamS and lamH genes, the first 
pre-production step involves the use of NAM to cause male 
sterility in the plant line that carries only the lamH 
gene. When cross pollinated with the pollen from the 
plant that contains the lamS gene, one can expect up to 
one thousand seeds per unit area, each seed capable of 
growing into a male sterile plant . When these seeds are 
planted and crossed with a male fertile plant, one can 
expect one thousand seeds per unit area. Therefore, if 
one were to plant on^ acre of the plant line carrying the 
lamH gene and the pollinator carrying the lamS gene, this 
acre would need to be sprayed with NAM. From this one 
acre however, enough seed would be obtained to grow 1000 
acres of male sterile plants and pollinators, and from 
these 10 00 acres, enough hybrid seed with restored 
fertility would be obtained to plant 1,000,000 acres of 
hybrid crop. The amount of management required to 
produce this hybrid seed is reduced over conventional 
methods because of the pre-production amplification step 
employed. If the lamH gene is linked to a herbicide 
resistance gene, one can plant the fields randomly to 
ensure high rates of cross pollination and use the 
herbicide to kill the pollinator plants after flowering. 
This method therefore allows for efficient hybrid seed 
production over methods where hybrid seed is harvested 
directly following the first cross pollination. 
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A particularly preferred embodiment of the above 
method is described in more detail below with reference 
to Figures 17 and 18. As illustrated in Figure 17, the 
method employs two plant lines which are homozygous, 
respectively, for the lamH gene (plant line A2) and the 
Jams (plant line Al) genes and otherwise isogenic. These 
genes are located on the same chromosome pair of the 
plant. Accordingly, plants produced from a cross of 
these two isogenic lines will contain the lamS and the 
lamH gene respectively on different chromosomes of a 
single chromosome pair. This will ensure that the two 
genes will segregate when this plant is crossed with a 
male fertile plant. To produce the hybrid seed, a two 
step procedure is used. The first step involves a pre- 
production of an isogenic male sterile line, the second 
step is the hybrid seed production itself. To accomplish 
the first step the following approach is used: The two 
isogenic lines Al and A2 are planted in rows as shown, 
and when flowering starts, the plots are sprayed with NAM 
(naphthalene acetamide) . This chemical is a non-toxic 
version of the plant growth regulator NAA, and the action 
of the lamH gene converts NAM to NAA. Under the control 
of the promoter essential for pollen development and/or 
function, the lamH gene only is expressed in pollen of 
the A2 line, and as such NAA is only made in pollen of 
the A2 line. Since NAA is a plant growth regulator, 
normal anther and microspore development is altered, 
leading to male sterility in the A2 plant line when 
treated with NAM. The plants which contain the lamS gene 
under the control of the promoter essential for pollen 
development and/or function (plant line Al) are not 
affected by the NAM, since they are unable to convert NAM 
to NAA, therefore these plants remain fully male fertile 
and can cross pollinate the A2 plants which have now 
become male sterile after treatment with NAM. On the A2 
line, seed is produced that contains both the lamH and 
lamS genes under the control of promoters essential for 
pollen development and/or function (plant seed A2/A1) . 



wo 96/40950 



- 69 - 



PCT/US96/08692 



The seed produced on the A2 line (plant seed A2/A1) is 
harvested. This harvesting can be done by harvesting 
specific rows. Alternatively, the lamH gene (A2) may be 
linked to a gene for herbicide resistance so that the 
herbicide can be used for the roguing of the plant line 
Al. Herbicide application takes place after flowering 
and will kill the Al plants so that only seed that has 
the genotype A1/A2 is produced. The seed harvested from 
such a field will produce substantially 100% male sterile 
plants. The cross produces plants that express both the 
lamS and the lamH genes only .in the pollen. This leads 
to the conversion of tryptophan, an amino acid normally 
found in plant cells to lAM (indole acet amide) via the 
lamS activity and finally to lAA (indole acetic acid) via 
the lamH activity. The molecule lAA is a plant growth 
regulator not normally found in substantial quantities in 
the developing pollen grains or anther of the plant. 
Since the lAA and the precursor lam are small molecules 
that can be transferred from cell to cell via diffusion 
or active transport, altered growth regulator levels are 
seen throughout the anther. This altered growth 
regulator level leads to abnormality in pollen and anther 
development, producing a male sterile plant. This plant 
can be pollinated with a male fertile line leading to 
commercial hybrid seed. For the production of hybrid 
seed, the male sterile isogenic line can be planted in 
rows along side of a suitable male fertile plant, and the 
hybrid seed produced on the male sterile plant can be 
harvested. If the lamH gene is linked to a herbicide 
resistance gene, harvesting of hybrid seed is facilitated 
by using the herbicide to eliminate pollinator plants 
after cross pollination. The entire field can then be 
combined. All seed produced will therefore be hybrid. 
If the two genes (lamS and lamH) are located on the same 
chromosome or in the same linkage group, these two genes 
will segregate completely in the Fl hybrid seed. Since 
the plants will contain either the lamS or the lamH gene, 
but not both, the seed produced by this hybrid cross will 
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be substantially 100% male fertile. Therefore the plants 
grown from the seed of this cross will be fully fertile 
and set normal levels of seed. The F2 seed that results 
from the harvest of this field however will contain a 
variable degree of male sterility, since in theory 12*5% 
or 2 out of 16 of the plants grown from this F2 seed will 
contain both the lamS and the lamH genes, as illustrated 
in Figure 7, Therefore, particularly in poorly 

outcrossing species, the F2 generation will not be 
attractive to plant because up to 12.5% of the plants 
will not produce seed. Similarly, in outcrossing species 
which are not fully pollinated, a corresponding reduction 
in yield will be seen. 

It is contemplated that as a variation of the above 
particularly preferred method, a number of different ways 
of producing the toxic molecule specifically in pollen 
can be envisioned. In all approaches, at least one step 
in the production of the cytotoxic molecule has to take 
place specifically within the pollen cells or anthers. 
For instance, it is possible to use a const itutively 
expressed lamS gene in a plant and to subsequently cross 
that plant with a plant that contains the lamH gene under 
the control of a promoter essential for pollen 
development and/or function such that lam is produced in 
all cells of the plant, but the growth regulator lAA is 
produced only in pollen cells due to the action of the 
pollen specific lamH gene. Conversely , it is possible 
to have lamH constitutively expressed in a plant, and 
cross this plant with a plant that contains a promoter 
essential for pollen development and/or function driving 
the lamS gene. In this situation, the growth regulator 
lAA is only produced in pollen cells. It should be 
cautioned that in this case, one cannot use NAM to induce 
transitory male sterility in the plant that contains the 
lamH gene, since that application of NAM would be lethal 
to the plant. In this case then hand pollination would 
be the preferred way of combining those genes. Wich 
regards to these methods the preferred embodiment of the 
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present invention places both the lamH gene and the lamS 
gene under the control of promoters essential for pollen 
development and/or function and preferably using the same 
promoter essential for pollen development and/or function 
5 or a promoter essential . for pollen development and/or 
function whose expression substantially overlaps that of 
the other to each independently drive the expression of 
these two genes. Additionally, by linking the lamH gene 
to a selectable agent such as a herbicide, hybrid seed 

10 production is greatly facilitated. Any number of genes 
could '•be used to carry out this invention providing that 
the simultaneous production of two or more enzymatic or 
synthetic activities specifically in pollen leads to the 
production of a substance which is toxic or inhibitory to 

15 normal pollen growth or specifically interferes with 
anther or pollen development. This implies that one or 
more of these activities could be constitutive in the 
plant, but that the final combination of all enzyme 
activities be. limited to pollen. It is also envisioned 

20 that one of these activities could be inducible by 
natural or artificial means such that sterility could be 
induced in plants . Specifically one embodiment of this 
method uses a plant line that carries a lamS gene under 
the control of an inducible promoter and a lamH gene 

25 under the control of a promoter essential for pollen 
development and/or function. These genes are preferably 
linked, but could be unlinked. When grown under 
inductive conditions, the plant becomes male sterile and 
can be pollinated by a suitable male fertile plant. The 

3 0 suitable plant could also carry a lamS gene under the 
control of a promoter essential for pollen development 
and/or function such that the progeny of this cross will 
be male sterile. These plants could then be crossed with 
a male fertile plant, producing hybrid seed. This seed 

35 would, depending on the location and number of the 
inserted genes, carry varying degrees of male sterility. 
It is also contemplated that a suitable male fertile 
plant be one that carries one or more copies of an anti- 
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sense gene that is capable of selectively inhibiting the 
expression of one or more of these genes such that 100% 
fertility is recovered in the plants grown from the 
hybrid seed. 

5 Another particularly preferred embodiment of the 

invention uses lamS and lamH genes to produce a male 
sterile parent A as described herein, with the two genes 
introduced in Parent A as one recombinant DNA molecule, 
i.e., linked. This would have the advantage that the two 

10 genes would not segregate in Tl, as would be the case 
with the two genes being introduced into isogenic lines, 
on the same chromosomal pair. It is further proposed 
that the recombinant DNA molecule of this particularly 
preferred embodiment contains, linked to the lamS and 

15 lamH genes, a gene that serves as a selection marker for 
the transformation. It is preferred that the selection 
marker be a herbicide resistance gene to glufosinate 
(commercial names are Basta and Liberty) or glyphosate 
(Roundup) or TBL as discussed hereinbefore. Parent A 

20 would be maintained in a homozygous state for the gene{s) 
producing the male sterile trait (lamS and lamH in this 
embodiment) and for the resistance marker (herbicide 
resistance in this embodiment) by methods of gamete 
transformation as described hereinbefore. Parent A is 

2 5 maintained by crossing with an isogenic, but not 

transformed, parent A and herbicide spraying, as shown in 
Figures 20 and 21. 

Parent B would be the male parent. In a preferred 
case, it would be a plant line that was produced by 

3 0 transformation with the herbicide resistance gene, using 

methods described hereinbefore and made homozygous for 
the herbicide resistance gene. As described in Figure 
21, the hybrid seed would be 100% resistant to herbicide- 
In an alternative embodiment, if a male fertile seed 
3 5 is desired, parent B would be produced by using a 
recombinant DNA molecule containing both male fertility 
restoration and herbicide resistance genes as described 
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in Figure 20. The resulting hybrid seed is 100% 
herbicide resistant and 100% male fertile. 

Of course, parent A could be produced by crossing 
isogenic lines, each containing one of the two genes, 
5 i.e., lamS and lamH, linked to the herbicide resistance 
marker. Both the plant carrying the lamS and the plant 
carrying the lamH genes are made homozygous for the genes 
by methods of transforming anther or microspore cell 
culture or by plant selfing as described herein. This is 

10 illustrated in Figure 22a. Parent A is selected and 
maintained as described^ above and as illustrated in 
Figure 22b. Parent A is crossed with parent B, produced 
as above. The hybrid seed is 100% herbicide resistant 
and, if parent B contains a male restoration gene(s), 

15 then the hybrid seed is 100% male fertile. The plant 
resulting from the cross and the hybrid seed is 100% 
herbicide resistant and either 100% male sterile or 100% 
male fertile, as determined by which recombinant DNA 
molecule was chosen to be introduced in the genome of 

20 parent B. DNA sequences that can serve to restore male 
fertility to a hybrid plant, were one parent was made 
male sterile were discussed hereinbefore. A preferred 
restorer gene would be an anti- sense gene to either gene 
1 and/or gene 2 (lamS and lamH genes, described above) , 

25 The use of anti-sense gene(s) was described herein. 

Another preferred male fertility system in the hybrid 
plant would utilize, gene expression repressor 
mechanism (s) , especially the lexA system discussed 
hereir±>ef ore . A transactivator can also be converted to 

30 a represor to create a dominant negative inhibitor which 
could inhibit the expression of the "target" promoters 
linked to gene 1 and/or gene 2. In yet another preferred 
embodiment, judicious construction of the promoter 
sequences so' as they contain recognition sices for 

35 binding by a transactivator or transdominator would allow 
greater concrol of lamS and/or lamH expression. If the 
transdominacor sequences were themselves indicuble by 
some externally applicable chemical, one could propagate 
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the plant with male sterile genes as a fertile plant 
until the need to utilize such plant as male sterile in 
a hybrid cross . 

To produce a male sterile parent A for the hybrid by 
use of the linked gene 1 and gene 2 scheme described 
above, it would be essential that expression of at least 
gene 1, or gene 2, or both, be regulated by employing a 
promoter active only in cells involved in pollen 
formation/ structure. Such promoters were described 
hereinbefore. A preferred case would employ the anther- 
specific promoters described herein, BplO delivered from 
Brassica gene LIO and/or the anther- specif ic gene from 
tobacco, TA39. Herbicide resistance gene(s) could be 
constitutively expressed. 
15 In Figure 1, a schematic representation of the 

production of the anti-sense vector PAL1302 is shown. A 
plasmid containing the GUS gene (Beta-glucuronidase , 
described in Jefferson, R.A. , Plant Molecular Biology 
Reporter, 1987, 5: 387-405) in the anti-sense orientation 
20 flanked by the CaMV 3 5S promoter and the nos termination 
signal was obtained from the vector pBr221.1 (available 
from Clonetech Laboratories, Palo Alto, CA, USA) . The 
GUS coding sequence found between the CaMV 35S promoter 
and the nos ter of the vector pBI221.1 was excised and 
25 digested with the restriction enzymes Smal and Sstl. The 
SstI site was made blunt -ended using the Klenow fragment 
of DNA polymerase I and the blunt -ended vector and the 
GUS coding sequence were relegated. A plasmid (pPAL3 03) 
containing the Gus coding sequence inverted with respect 
3 0 to the direction of transcription of the CaMV 3 5S 
promoter was identified. 

The binary vector PAL1302 containing the anti-sense 
GUS gene was constructed using the vector pVUlOll 
(obtained from The Plant Breeding Institute, Cambridge, 
35 UK) . pVUlOll contains the hygromycin phosphotransferase 
coding sequence flanked by the CaMV 35S promoter and the 
HQS ter inserted into the polylinker of the Aarobacterium 
binary vector Binl9 described by Bevan, M., Nucl. Acids 
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Res. 1984, 12: 8711-8721. The vector pVUlOll can confer 
both hygromycin and kanamycin resistance to transformed 
plant cells. The insertion of the CaMV35S promoter anti- 
sense. GUS nos ter fragment into pVUlOll was accomplished 
5 in such a way as to inactivate the NPTII gene of this 
vector and was performed a follows. A small SphI - PstI 
restriction fragment containing the right border (RB) , 
the NOS promoter and the beginning of the NPT II coding 
sequence of pVUlOll was first subcloned into the SphI and 

10 PstI sites preceding the CaMV 35S promoter of pPAL303 to 
form pPAL306. Digestion of pPAL306 with SphI and EcoRI 
released a fragment consisting of the RB, the NOS 
promoter, the beginning of the NPT II coding sec[uence and 
the CaMV 35S promoter - anti-sense GUS noster construct. 

15 This fragment was then ligated into the SphI sites of 
pVUlOll by adding to the ligation pGEM-4Z (Promega 
Biotech, Madison, WI, USA) cut with EcoRI and SphI to 
provide a small fragment of polylinker as a bridge 
between the SphI site of pVUlOll and the EcoRI site of 

20 the insert from pPAL»306 respectively. The orientation of 
the insert was verified and a binary vector {PAL13 02) 
possessing a reconstructed RB fragment and a NPT II gene 
inactivated by the CaMV 3 5S promoter - anti- sense GUS 
gene-noster insertion was identified. This vector can 

25 confer only hygromycin resistance to plants and carries 
the anti -sense GUS gene. 

In Figure 2a-d the orientation of the genes contained 
within the four microspore specific clones from Brassica 
napus are from 5' to 3 ' . As shown, the 5' region 

3 0 corresponds to the promoter region and is identified with 
a small arrowhead. The 3' region delineates the end 
point of transcription of the gene. Clones L4 , LIO and 
L19 were used for the isolation of microspore specific 
promoter fragments and for the isolation of microspore 

35 specific coding regions. The non- transcribed regions are 
identified as a single thin line, while the regions of 
the clones that are transcribed are demarcated by a boxed 
area. Within this boxed area the portion of the 
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transcribed DNA that represents the exon regions is 
demarcated by being filled in black while the intron 
sequences are left unfilled. The approximate regions of 
DNA sequenced for clones LIO, LIS and L19 are shown by 
5 underlining. Restriction sites identified are those that 
are relevant to the constructs detailed below. The right 
and left arins of the lambda cloning vectors are not 
shown. 

In Figures 3a-d, the complete DNA sequence of the 

10 clone L4 is shown along with the DNA sequence of the 
portions of the clones LIO, L16 and L19 that are 
identified in Figures 2a-d. In Figure 3a, clone L4 , 
nucleotide 1 in the complete sequence is at the left -most 
EcoRI site while nucleotide 8579 is at the first 

15 nucleotide of the right-most EcoRI site. The start of 
transcription of gene 1 in clone L4 is nucleotide 235. 
The 5/ and 3' intron splice sites are identified in 
boldface type. The start ATG codon is shown as well as 
the stop termination codon. The deduced amino acid 

20 sequence of the proteins encoded for by these genes are 
also shown. The end of transcription for gene 1 is 
approximately nucleotide 1427. The second gene in clone 
L4 is most likely non- functional due to a insertion and 
a deletion that occurs in the region of the promoter and 

25 first exon. This gene was not utilized for constructs. 

The third gene in clone L4 has a transcriptional start at 
position number 6298 in the DNA sequence and 
transcription ends at approximately nucleotide 74 90. The 
ATG start codon, intron splice sites and termination stop 

30 codon are all identified as above. Vectors were 
constructed from this clone by using promoter fragments 
from both genes 1 and 2, as well as promoter fragments 
from genes 1 and 2 that contained the first exon and 
intron sequences and a short portion of the second exon 

35 for each of the genes. The specific promoter fragment 
constructs are detailed below. 

In Figure 3b, the nucleotide sequence of the region 
of clone LIO demarcated in Figure 2b is shown. The start 
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Of transcription is at nucleotide 1. in this sequence 
the ATG start codon is at nucleotides 45-4 7, the first 
exon ends at nucleotide 315 the second exon starts at 
nucleotide 476 and extends to nucleotide 1586. The third 
exon starts at 1673 and extends approximately to 
nucleotide 1989. The precise end of transcription was 
not determined. The deduced amino acid sequence is also 
shown. For some promoter constructs the region of the 
clone 5' to the sequenced portion were used. The 
specific details of the constructs are listed below.. 

In Figure 3c, the nucleotide sequence of clone LI I 
is shown. Clone L16 shows considerable homology to clone 
LlO specifically in the portions of the two clones that 
code for protein sequence. The intron sequences between 
the two clones differ considerably however. Clone L16 
does not contain a 5' promoter region and as such was 
used only as a source of coding sequences for ant i- sense 
RNA constructs. Nucleotide 1 demarcates an EcoRI site 
that occurs in a coding region of the DNA that is 
homologous to the first exon of clone LlO. By homology, 
this coding region extends to nucleotide 124 , where the 
first intron is located. This intron, which is located 
at the same relative position as the first intron of 
clone 10, is longer than the intron in clone LlO and 
extends to nucleotide 688. Nucleotide 689 is the start 
of the second exon and this exon, which shows strong 
homology to the second exon of clone LlO, extends to 
nucleotide 1793. There is a second intron at this point 
and this intron extends to nucleotide 1909. The third 
exon starts at 1910 and extends to approximately 
nucleotide 2210. The deduced amino acid sequence is also 
shown for specific regions of the clone that show 
considerable homology to clone LlO. The precise 
nucleotide where transcription stops has not been 
determined. 

In Figure 3d the nucleotide sequence of the region 
of clone L19 demarcated in Figure 2d is shown. The start 
of transcription is located at position i in the 
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sequence. The ATG start codon is at nucleotides 136-138 
and the first intron starts at nucleotide 1201. This 
intron ends at nucleotide 1338 wherein the second exon 
starts. The end of transcription occurs at approximately 
nucleotide 2074. The deduced amino acid sequence is also 
shown. 

In Figure 4, the DNA sequences of three cDNA clones 
that are homologous to the genes contained in the clone 
L4 are shown. The DNA sequence of these three cDNA 
clones as well as the sequence of the correctly spliced 
transcribed regions of genes Bp4A and Bp4C in the genomic 
clone L4- are aligned, only the nucleotide differences 
within these clones are shown, nucleotides that are 
conserved between the sequences are only shown on the 
upper sequence. The asterisks shown in Figure 4 mark the 
5' end of the cDNA clones of cBp401, cBp4 05 and cBp4 08. 

In Figure 5, the partial nucleotide sequence of a 
cDNA clone that is homologous to the coding region of 
clone Lie is shown. This cDNA clone is approximately 1,3 
Kb in length and has EcoRI sites at the 5' and 3' ends of 
the cDNA sequence that were added via synthetic linkers 
in the cDNA cloning procedure. 

In Figure 6, the nucleotide sequence of the cDNA 
clone that corresponds to the coding region of clone LI 9 
is shown. Identified in this sequence is the EcoRV site 
present at the 5' end of the transcribed region of L19. 
A portion of the poly A tail is shown. Not shown are the 
EcoRI sites that were added as linkers in the cDNA 
cloning procedure; these sites are present adjacent to 
the 5' and 3' ends of the cDNA clone. 

In Figure 7 {a,b,c,d,e) the construction of 6 vectors 
containing promoter and promoter fragments from the clone 
L4 is described. The first step in the construction of 
these vectors was accomplished by first subcloning the 
EcoRI-SstI (nucl .1-2132) fragment containing the first 
gene of clone L4 (235 base pairs of 
promoter/exon/intron/second exon) in the commercially 
available vector pGEM.42 (Promega Biotech, Madison, WI , 
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USA) using the EcoRI-SstI sites of the polylinker of this 
vector. This plasmid was named pPAL04 02. The 2.7Kb 
EcoRI fragment of clone L4 that contains the third gene 
(Bp4C) was then cloned into the EcoRI site of pGEM 4Z, 
5 leading to a plasmid called pPAL0411, The plasmid 
pPAL04 02 was then digested with EcoRI and the 2,7 Kb 
EcoRI fragment from pPAL0411(nucl . 5859-8579) that 
contains the gene number three (Bp4C) from clone L4 was 
added to it . Clones were recovered that contained this 

10 inserted 2.7 Kb EcoRI fragment in both orientations 
relative to the promoter region of the first gene. A 
clone that contained this' third gene fragment in a 
orientation such that the promoter from the third gene 
was opposite to the promoter in the first gene was chosen 

15 and called pPAIj0403. The plasmid pPAL0403 contains the 
entire third gene from clone L4 oriented in such a 
fashion as to have the promoter region immediately 
adjacent to the 23 5 basepair promoter region of the first 
gene in pPAL0403 . This plasmid, pPAL0403 was digested 

20 with Ddel, producing a fragment of approximately 1.9 Kb. 
The Ddel sites are located at nucleotides 303 and 7366. 
Because of the orientation of these fragments, digestion 
with Ddel produces a 1.9 Kb fragment. This 1.9 Kb 
fragment contains a copy of the third gene (Bp4C) 

25 oriented such that the direction of transcription of this 
third gene is from right to left, fused to the 235 base 
pair promoter fragment from the first gene of clone L4 
(Bp4A) which is transcribed from left to right, ending in 
a Ddel site that is located 67 base pairs downstream of 

3 0 the major start site of transcription and precedes that 
ATG start of translation codon by 2 nucleotides. This 
1.9 Kb Ddel fragment was made blunt with Klenow fracpnent 
and cloned into the Xbal site of the polylinker region of 
pGEM 42 previously made blunt ended with Klenow fragment. 

35 The resultant plasmid pPAL0408, was recovered and 
subsequently was digested with Sail and SstI, which 
releases the cloned Ddel fragment bordered by on the left 
hand side, (nucl 7366) Sail and on the right hand side 
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(nucl 303) of this construct and contains a portion of 
the polylinker of pGEM 4Z containing the following unique 
sites: BamHI, Stnal, Kpnl, and SstI restriction enzyme 
sites. This Sail - SstI fragment was cloned into the 
Sail - SstI sites of PALIOOI. PALlOOl is the binary 
vector BinlS (described by Bevan, M. , Nucleic Acids Res., 
1984, 12:8711-8721) to which. has been added the nos ter 
polyadenylation signal as a 260 bp SstI - EcoRI fragment 
isolated from the plasmid pRAJ221 (available from 
Clonetech Laboratories, Palo Alto, CA USA) in the SstI - 
EcoRI sites of the polylinker region of >iinl9. This nos 
ter is identified as a stippled box. The binary 
transformation vector that resulted from the insertion of 
the Sail - SstI fragment of pPAL0408 into PALIOOI was 
15 named PAL1107. The details of the construction are shown 
in Figure 7a. This vector has a copy of the third gene 
oriented such that the direction of transcription of this 
third gene is from right to left, fused to the 235 base 
pair promoter fragment from the first gene of clone L4 
which is transcribed from left to right, followed by a 
polylinker with unique sites for the insertion of DNA 
which consist of: BamHI , Smal, Kpnl and SstI followed by 
the nos ter signal . This vector has the feature in that 
additional 5' non-coding sequences were placed upstream 
2 5 to the 23 5 base pair core promoter on Bp4A, but these 
additional 5' sequences were in i\ opposite orientation. 
The provision of these sequences in this orientation does 
not affect the pollen specif icity of the core 235 base 
pair promoter. 

30 In addition to this vector, similarly structured 

vectors were made which contained essentially the same 
type of gene promoter arrangement but contained the 
intron of the first gene (Bp4A) of clone L4 . Intron 
sequences in plant genes have been shown in some cases to 

35 play a role in gene expression. This intron containing 
vector was constructed by making a deletion series of the 
clone PPAL0402. pPAL0402 was first digested with PstI 
and Smal. Exonuclease III was used to unidirectionally 
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digest the DNA as shown (Figure 7b) . After SI nuclease 
treatment and repair with Klenow, the plasmid was 
relegated and clones that have had different portions of 
the coding regions of gene Bp4A digested out of them were 
5 recovered. Deletion subclones were sequenced. One was 
chosen for vector constructs. This is referred to as 
deletion 23B, . This subclone represented a deletion that 
has most of the second exon of gene Bp4A removed but 
contains the intron splice site and first exon of gene 

10 Bp4A. This subclone contains a portion of the clone L4 
that extends from nucleotide 1 to nucleotide 1166. To 
this subclone was added the 2.7 Kb EcoRI fragment from 
pPAL0411 that contains the third gene of L4 (Bp4C) in 
such an orientation that the direction of transcription 

15 of the third gene is from right to left (as in PAL1107, 
pPAL0408) , fused to the 23 5 base pair promoter region 
from the first gene of clone L4 which is oriented to 
transcribe from left to right followed by the first exon 
of gene 1, the entire intron of gene 1 and 3 3 nucleotides 

20 of the second exon of gene Bp4A from clone L4 . This 
plasmid containing deletion 23B and the 2.7 Kb EcoRI 
fragment containing the third gene fragment was named 
PPAL0406. This plasmid was digested with Hindlll, which 
yields a fragment containing a small portion of the 

25 promoter of the third gene as well as the entire promoter 
of the first gene, first exon, intron and a portion of 
the second exon. This Hindu I fragment was inserted into 
the Hindlll site of PALlOOl, resulting in the vector 
PAL1106 (deletion 23B derived) . This vector has in the 

3 0 following order, A portion of the promoter from the third 
gene in clone L4 , the entire 235 base pair promoter of 
the first gene in clone L4 , followed by the first exon, 
the intron and a portion of the second exon of gene 1 of 
clone L4, followed by a polylinker containing the 

35 following unique cloning sites: Sail, Xbal, BamHI, Smal, 
Kpnl and SstI and the nos ter polyadenylation signal. 
The construct is shown in Figure 7b. 
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Additional constructs with the promoter regions of 
the genes contained in clone L4 were done in order to 
provide a number of suitable vectors that are useful for 
pollen specific expression of gene sequences. The three 
genes within clone L4 (Bp4A, Bp4B, Bp4C) show very near- 
exact DNA homology and this is most apparent between the 
first (Bp4A) and third (Bp4C) gene. The second gene 
(Bp4B) is a homologous copy that has undergone sequence 
changes that have appear to have lead to inactivation. 
The extensive similarity between the first, second and 
third genes in clone L4 is also maintained in the 
promoter region such that out of the first 23 5 
nucleotides of the first and third gene promoter regions 
there are only 5 nucleotides that differ between them. 
Downstream of the TATA box in these two promoters the 
only difference between them is the presence of one 
additional nucleotide at the start of transcription. For 
example, comparison of Promoter 1, Bp4A, partially 

represented as: TATGTTTt AAAA ... with Promoter 

3,Bp4C, partially represented as: TATGTTT AAAA. . . . 

shows that the transcribed region underlined and the 
single nucleotide difference in lower case. However, 
within the sequence of the first gene there is a 
nucleotide change that introduces a Ddel site (nucl . 303) 
in the untranslated 5' leader sequence upstream of the 
ATG start codon that is not present in the untranscribed 
leader sequence of the third gene in clone L4 . Chimeric 
promoter constructs were made which utilized this Ddel 
site in the first gene to combine with sequences from the 
third gene promoter. The region of the first promoter 
used for these constructs consisted of the sequences 
contained between the SnaBI site (nucl 21) near the TATA 
box to the Ddel site located immediately upstream of the 
ATG start codon in the first gene (nucleotide 303 is the 
first nucleotide in the recognition sequence for Ddel) . 
The other region of this chimeric promoter (5' of the 
TATA box) was a fragment extending from the EcoRI site of 
the third promoter (nucleotide 5858) to the SnaBI site 
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near the TATA box (nucleotide 6273). Therefore to 
facilitate construction of these pollen specific vectors, 
the following reconstructions were performed. 

The EcoRI to Ddel fragment that encompasses the 
promoter region of the first gene in clone L4 was 
isolated by first cutting pPAL0402 with Ddel, blunting 
with Klenow, and then cutting with EcoRI. The 235 base 
pair fragment corresponding to this region was cloned 
into the EcoRI - Smal sites of pGEM 42. This plasmid 
(pPAL0422) , was then cut with EcoRI and SnaBI . A DNA 
fragment that container^ the EcoRI to SnaBI portion of the 
promoter for gene 3 in clone L4 was isolated by digesting 
PPAL0411 with EcoRI and SnaBI. This released an 
approximately 415 base pair EcoRI (nucl. 5858) to SnaBI 
(nucl. 6273) fragment that represents most of the 5' 
region of the gene 3 promoter from clone L4 (the SnaBI 
recognition site is 2 base pairs downstream of the TATA 
box) . This EcoRI - SnaBI fragment was used to replace 
the shorter EcoRI - SnaBI fragment removed for the first 
promoter subclone (pPAL0422), reconstructing a promoter 
fragment of approximately 550 base pairs. This plasmid 
is referred to as pPAL0421. This chimeric promoter 
fragment contains 415 base pairs of the promoter of gene 
three in clone L 4, followed by approximately 99 
Nucleotides of the first gene promoter/untranslated 
leader sequence. 

For construction of a pollen specific cassette 
vector, the following plasmids were first constructed. 
The first plasmid constructed contained the nos ter 
polyadenylation signal with a polylinker in front of the 
nos ter signal. This was accomplished by first isolating 
from pRAJ221 the nos ter as a SstI - EcoRI fragment and 
this fragment was cloned in pGEM 4 2 using the SstI and 
EcoRI sites in the polylinker. This subcloned is 
referred to as pPALOOl. To pPALOOl, a fragment coding, 
for neomycin phosphotransferase (NPT II) derived from the 
plasmid pRAJ162 was added to it in the anti-sense 
orientation as follows: The plasmid pRAJ162 contains the 
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NPT II gene from the trahsposon TN5 inserted as a Sail 
fragment and bounded by a polylinker in the plasmid pUC-9 
(which was obtained from the Plant Breeding Institute, 
Cambridge, UK) . pRAJ162 was digested with Hindi! I and 
Smal. The DNA fragment containing the NPT II gene was 
isolated by elution from an agarose gel. pPALOOl was 
digested with Hindi II and Smal and the NPT II gene 
fragment was inserted. The resultant plasmid was called 
PPAL002 and had such orientation of restriction sites and 
the NPT II gene and nos ter as follows: HINDIII, PstI, 
Sail, 3' end NPT II coding sequence 5' end. Sail, BatnHl; 
Smal, Kpnl, SstI, nos ter . EcoRI . pPAL002 was cut with 
Hindlll and the site made blunt ended by the use of 
Klenow fragment. pPAL0421 was digested with Hindi and 
pVUII, both of which leave blunt ends, and the promoter 
fragment was ligated into Hindi I I cut blunt ended 
PPAL002. Plasmids were obtained that contained the 
promoter in both orientations relative to the nog ter 
signal. One plasmid was chosen with the proper 
orientation (5' promoter/anti- sense NPT I I / nos ter ) and 
was named pPAL0419. pPAI,0419 has the following DNA 
fragments: A small (approx. 130 bp) of pGEM 4Z that 
contains the SP6 promoter, the 550 base pair chimeric 
promoter, the NPT II gene in the anti-sense orientation 
relative to the promoter, followed by the nos ter 
polyadenylation signal. This entire promoter/NPT I I / nos 
£er construct is excisable by EcoRI. pPAL04l9 was 
digested with EcoRI, and the promoter NPT II nog £^ 
structure was cloned into Binl9 using the single EcoRI 
site in the polylinker of Binl9 . The resultant 
transformation vector was named PAL1419. In addition to 
the anti-sense NPT II gene, the vector contains a 
constitutive NPT II gene under the control of the nos 
promoter. This vector therefore confers resistance to 
kanamycin in all cell types with the exception of pollen 
cells where the gene expression from the constitutive 
promoter is inhibited by the anti-sense RNA produced from 
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the promoter/NPT Il/nos ter construct contained in 
PAL1419. 

In order to provide promoter sequences that could be 
utilized with additional gene constructs, the plasmid 
5 pPAL0419 was digested with Sail. This digest removes the 
NPT II coding region and this Sail digested pPAL0149 was 
relegated giving rise to pPAL0420. pPAL0420 represents 
the promoter essential for pollen development and/or 
function followed by a polylinker for insertion of genes 

10 that has the following unique sites: Hindi, PstI, Sail, 
BamHI, Smal, Kpnl, SstI, followed by the nos ter 
po 1 y adeny 1 a t i o n signal. The entire 

promoter/polylinker/nos ter construct can be conveniently 
excised as a single EcoRI fragment. The details of this 

15 construct is shown in Figure 7c. 

For additional promoters essential for pollen 
development and/or function constructs, the following 
approach was used. The intact L4 clone in the lambda 
cloning vector was digested to completion with the 

20 restriction enzymes SstI and Hhal . The resultant 
fragments were separated by gel electrophoresis and a 
2.65 Kb fragment that contains the promoter/first 
exon/ intron/partial second exon region of gene three in 
clone L4 and corresponds to nucleotides 4565 to 7216 in 

25 the sequence of clone L4 was isolated. This fragment was 
made blunt ended with Klenow and cloned into the binary 
transformation vector PALlOOl previously described. 
PALlOOl was first cut with Hindlll and made blunt ended 
with Klenow, Clones containing this fragment 

30 (promoter/first exon/intron/partial second exon) were 
recovered. A clone was chosen that contained this 
fragment in the proper orientation such that the 
direction of transcription was towards the nos ter in 
PALlOOl. This* vector was named PAL1421. This vector 

35 contains approximately 1 . 9kb of upstream promoter region 
from the gene 3 in clone L4 followed by the first exon, 
the complete intron and 15 bases of the second exon of 
gene three followed by a polylinker containing the 
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following unique sites: Sail, Xbal, BamHI , Smal, Kpnl, 
SstI, and finally the nos ter polyadenylation signal. A 
variant of this vector was constructed by digesting 
PAL1421 with EcoRI and isolating the fragment from this 
clone that contains the promoter polyl inker nos ter 
sequences but contained less of the upstream region of 
the promoter. This fragment was re-cloned into PAL1009. 
PAL1009 is a Binl9 derived vector from which most of the 
polylinker has been removed. This vector was constructed 
by digesting Binl9 with Hindlll and SstI, making these 
sites blunt ended with Klenow and relegating such that e 
vector was recovered that contained a single unique EcoRI 
site for the insertion of fragments. PAL1009 was 
digested with EcoRI and the EcoRI fragment from PAL1421 
that contains a shorter promoter/exon/intron/second 
exon/polylinker/ nos ter structure was added to it. This 
gave rise to the vector PAL1422, a vector that is 
essentially the same as PAL.1421 with the exception that 
there is less 5' promoter region. It should be noted 
that both PAL1421 and PAL1422 contain the intron from the 
third gene. For constructs which the presence of the 
intron may not be desired, intron sequences were removed 
from PAL1421 by first digesting PAL1421 with EcoRI and 
replacing the promo t e r / e xon / i n t ron / s e cond 
exon/polylinker/noster structure with the 
promoter/polyl inker /nos tsr structure from pPAL0420 using 
EcoRI such that a longer 5' promoter region is 
reconstructed in the binary transformation vector. The 
resultant vector was named PAL1423 . The outline of this 
construction is shown in Figure 7d. 

In Figure 7e, a schematic diagram of the relationship 
of the above described vectors is presented. It should 
be noted that the vectors outlined in this Figure fall 
into three categories: 1) vectors which contain 5' 
upstream promoter regions that are substantially derived, 
from the upstream region of the gene Bp4C (pPAL0420, 
PAL1420, PAL1423), 2) promoter constructs that contain 5' 
upstream promoter regions and xntron sequences from the 
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gene Bp4C (PAL1422, PAL1421) and, 3) promoters which 
contain a chimeric 5' upstream region in which a portion 
of the 5' DNA sequence is inverted relative to the 
arrangement which appears in the genomic clone and uses 
5 the promoter fragment of Bp4A as a core promoter 
structure (PAL1107, PAL1106) . It should be noted that 
the functioning of each of these constructs can vary from 
plant species to plant species and it may be desirable to 
test a number of these promoter constructs when carrying 

10 out certain aspects of this invention. 

The construction of pollen specific vectors that 
utilize the promoter regions of clones LIO and L19 was 
conducted as follows. The construction of the pollen 
specific vectors depicted in Figure 8 utilizes promoter 

15 regions from clone LIO, The start of transcription of 
clone LIO is located at nucleotide 1. The ATG start 
codon is located at nucleotides 45-47. The promoter 
region of this clone was excised by first subcloning the 
EcoRI - Xbal fragment of the clone that encompasses the 

20 entire promoter region and a portion of the first exon 
(the Xbal site is nucleotide 358 in the DNA sequence) . 
This subclone (pPALlOEX) was then digested with Hindi 
and Ndel. The Ndel site is located immediately 
downstream of the ATG start codon at nucleotide 6 0 and 

25 the Hindi site is located at nucleotide number -399. 

The digestion with these two enzymes releases a DNA 
fragment of 4 59 nucleotides which contains 62 nucleotides 
of untranslated transcribed leader sequence, and 3 97 
nucleotides of 5' promoter region. The Ndel site in this 

3 0 fragment was made blunt ended by the use of Klenow, and 
this fragment was subcloned into the Hindi site of the 
polylinker of pGEM 4Z. Clones were recovered in both 
orientations and the clone that contained the fragment in 
the orientation: Hindlll, SphI , Pstl. Hindi, promoter- 

35 62 base pair leader fragment (Ndel blunt/Hindi, does not 
cut with either Hxncll or Ndel) Xbal, BamHI, Smal, Kpnl, 
SstI, EcoRI was chosen and named pPALlG20. To add 
additional upstream regions, the Hindi - Hindi fragment 
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that is approximately 1 Kb in length and is immediately 
upstream of the Hindi site at position 3 91 in the DNA 
sequence was isolated from pPALlOEX by digestion with 
Hindi and gel elution of this fragment. This Hindi 
fragment was cloned into the Smal site of pGEM 4Z. 
Clones which contained the fragment in both orientations 
were recovered and a clone that contained the fragment in 
the following orientation was chosen: Hindlll, Sphi, 
PstI, Hindi, Sail, Xbal, BamHI, the Hindi fragment in 
the same orientation as in the genomic clone, that being 
right to left , 5' -3 ' (as ^ Hindl/Smal insertion which 
does nut cut with either enzyme), Kpnl, SstI, EcoRI . 
This subclone (pPALlOHc) was digested with Knp I, made 
blunt end by the use of Klenow, then digested with EcoRI. 
To this cut subclone was added the promoter/untranslated 
leader sequence of pPAL1020 by digesting pPAL1020 with 
Hindi and EcoRI, and adding this promoter fragment to 
the cut pPALlOHc. The resultant subclone contained a 
reconstructed promoter region of clone LIO differing from 
the intact region by only the filled in Kpnl site used 
for the joining of the two promoter fragments. This 
construct was named pPAL1021. This vector contains in 
the following order: Hindlll, PstI, SphI, Hindi, Sail, 
Xbal, BamHI, the approximately 1 Kb Hindi fragment 
joined to the Hindi - Ndel promoter fragment followed by 
Xbal, BamHI, Smal, Kpnl, SstI, and EcoRI. This subclone 
allows for the convenient removal of the promoter region 
of clone LIO such that the promoter can be easily used in 
cassette transformation vectors. The outline of this 
construction is shown in Figure 8. The promoter region 
of pPAL1021 was used for the construction of a pollen 
specific cassette transformation vector by carrying out 
the following constructs: The plasmid pPAL1021 was 
digested with Ncol and Pstl. The plasmid was treated 
with Klenow and relegated. This procedure effectively 
removed the portion of the polylinker that was 5' to the 
promoter in pPAL1021. This plasmid was then digested 
with Hindlll and SstI. and cloned into the Hmdlll and 
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SstI sites of PALlOOl, giving rise to PAL1121. PAL1121 
has in the following order: the promoter essential for 
pollen development and/or function of clone LIO 
(approximately 1.1-1,2 Kb) , followed by a polylinker with 
the following unique sites: Xbal, BamHI , Smal, Kpnl, 
SstI, followed by the nos ter. The construction of this 
is outlined . in Figure 8. The LIO promoter isolated here 
is also referred to as promoter BplO. Also see Albani, 
D., Sardana, R., Robert, L.S., Altosaar, I., Arnison, 
P.G., and Fabijanski, S.F., A Br^ssica. napus gene family 
which shows sequence similarity ^to ascorbate oxidase is 
expressed in developing pollen. Molecular 
characterization and analysis of promoter activity in 
transgenic tobacco plants. Plant J. 2: 331-342 (1992). 

The promoter region of the clone LI 9 was also used 
for construction of pollen specific vectors. The 
construction of these vectors is as shown in Figure 9, 
Clone L19 has a single pollen specific gene contained 
with it. The start of transcription in this gene is 
located at position 1 in the DNA sequence. The ATG start 
codon is located at nucleotide position 136-138. The 
only intron is located at nucleotides 1202-13 87, the stop 
translation codon is located at nucleotides 2024-2026. 
The end of transcription is located at approximately 
nucleotide 2074. The entire EcoRI fragment of this clone 
was subcloned into PGEM 4Z by using the EcoRI site 
located in the polylinker. The resultant clone was named 
PPAL1901. The promoter region of this clone was excised 
as a single fragment by digesting pPAL1901 with BamHI and 
EcoRV, and a 2177 basepair fragment corresponding to the 
promoter region was isolated. This fragment covers from 
nucleotide -2200 (BamHI) to nucleotide 156 (EcoRV) . This 
promoter fragment contains over 2Kb of 5' upstream region 
of the promoter in clone L19, 134 basepairs of 5' The 
promoter region of the clone LI 9 was also used for 
construction of pollen specific vectors . The 
construction of these vectors is as shown in Figure 9. 
Clone L19 has a single pollen specific gene contained 
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with it. The start of transcription in this gene is 
located at position 1 in the DNA sequence. The ATG start 
codon is located at nucleotide position 136-138. The 
only intron is located at nucleotides 1202-1387, the stop 
translation codon is located at nucleotides 2024-2026. 
The end of transcription is located at approximately 
nucleotide 2074. The entire EcoRI fragment of this clone 
was subcloned into PGEM 4Z by using the EcoRI site 
located in the polylinker. The resultant clone was named 
pPALlSOl. The promoter region of this clone was excised 
as a single fragment by digesting pPAL19t^a with BamHI and 
EcoRV, and a 2177 basepair fragment corresponding to the 
promoter region was isolated. This fragment covers from 
nucleotide -2200 (BamHI) to nucleotide 156 (EcoRV) . This 
promoter fragment contains over 2Kb of 5' upstream region 
of the promoter in clone L19, 134 basepairs of 5' 
untranslated leader sequence and 2 0 basepairs of 
translated sequence. The BamHI site in this fragment was 
made blunt ended by the use of Klenow and cloned into 
PALIOOI. This step was accomplished by cutting PALlOOl 
with Hindlll, making this site blunt ended by the use of 
Klenow and inserting the blunt ended BamHI - EcoRV 
fragment in such an orientation that the promoter was 
oriented 5' to 3 ' with respect to the polylinker/nos ter 
polyadenylation signal. This vector was named PAL1920 
and contained within it in the following order: The 
promoter from clone L19 containing 134 base pairs of 5' 
untranslated leader sequence, 20 base pairs of translated 
sequence fused to a polylinker containing a former 
Hindlll site inactivated by blunt ending, SphI, PstI, 
Sail, Hindi, Xbal, BamHI , Smal, Kpnl, SstI (the unique 
cloning sites are underlined) , the nos ter 
polyadenylation signal. This vector is convenient for 
the insertion of DNA sequences to be transcribed in 
pollen cells. The outline of this construct is shown in 
Figure 9. A plasmid for future construct downstream of 
promoter TA39 was constructed as follows: TA39 promoter 
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was cut out and isolated from pPHP14 81 as a PstI 
fragment . 

It was ligated into pPHP585 (i.e., BlueScript SK+) 
cut with PstI and NotI, together with the Notl-PStI 
5 fragment from pPHP1428, and screened for the construct 
containing both the PStI and Pstl-NotI fragments. The 
screening employed in addition to Pstl-NotI, PvuII, and 
Hindlll digests. Resulting plasmid was pPHP1428. That 
portion of pPHPl4 94 incorporated into the lamS construct 

10 PPHP5840 (fig. 25) and the lamH construct pPHP5841 (fig. 

26) w.-s the Hindlll -BamHI fragment containing th^ 
TA3 9(883) promoter and 6'. 

In Figure 10, the restriction map of a BrsLSsica napus 
genomic clone (HPlOl) that contains a constitutively 

15 expressed gene is shown and the fragment of this clone 
that contains a 5' promoter region along with a portion 
of transcribed sequence is identified. This fragment was 
isolated by first cloning the small 2 . 5 kb EcoRI fragment 
in pGEM 4Z, and obtaining a subclone that had- this 

20 fragment inserted in the indicated orientation relative 
to the polylinker of pGEM 4Z. This clone was called 
pPALOlOl and was deposited January 26, 1990 with the 
American Type Culture Collection (ATCC) , 12301 Parklawn 
Drive, Rockville, MD, 20852, USA as pPALOlOl/E^ coli 

25 strain DH5 alpha xinder accession number ATCC 68210. This 
coli strain grows on standard coli media (LB) with 
100 micrograms per ml of ampicillin. This subclone, 
pPALOlOl, was then digested with- EcoRI, treated with 
Klenow fragment, then digested with BamHI, which releases 

3 0 the promoter/ transcribed region indicated. This fragment 
was cloned into Hindi - BamHI cut pGEM 4Z, resulting in 
the subclone pPALHPlOl. The subclone can be used for the 
isolation of promoter sequences in vector constructs that 
utilize a constitutive promoter to synthesize pollen 

35 specific anti-sense RNA. 

In Figure 11 a schematic representation of producing 
a polylysine coding gene is shown. In this construct, 
the cloning vector pGEM 42 was used as a recipient of a 
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synthetic oligonucleotide containing a ATG start codon 
and to this was added a polynucleotide consisting of 
solely one nucleotide. This gene therefore, depending on 
the nucleotide used is a gene that has predominantly one 
codon, and codes for a protein that is composed of a 
polyamino acid. The construct was carried out in the 
following fashion: To provide a ATG start codon in a 
favorable initiation context, a synthetic oligonucleotide 
was constructed and inserted into pGEM 42 between the 
Hindlll and SphI sites. This nucleotide had the 
sequence : 5 ' -ACGTGGATCCAAGATGACATG- 3 ' . The resultant 
subclone therefore had the DNA sequence (restriction site 
for a introduced BamHI site is underlined, the ATG start 
codon in bold) AACGT GGATCC AAG ATG ACA TGC GCA ACA TGG 
at the 5' region such that there was a ATG start codon in 
a favorable initiation context and a BamHI site upstream 
of this site for excision of the coding sequence. This 
subclone was digested with PstI, divided into two 
aliquots, one was tailed with T residues using terminal 
transferase and TTP, one was tailed with A residues using 
terminal transferase and dATP . The two tailed plasmids 
were mixed together, extracted with phenol -chloroform, 
ethanol precipitated and resuspended. The plasmid 
mixture was cut with SstI, and relegated. Clones that 
were recovered were either clones that contained on the 
coding strand poly A or poly T. Clones were cut with 
BamHI, the size of the insert determined by gel 
electrophoresis, and sequenced to determine if the clone 
coded for poly- lysine (poly A) or poly-phenylalanine 
(poly T, poly U in the mRNA) . A clone that coded for 
poly- lysine and was approximately 3 00 nucleotides was 
chosen and called pPAL pLys . This clone was cut with 
Xbal, blunt ended with Klenow and a universal translation 
terminator (available from Pharmacia PL Biochemicals , 
Montreal, Canada) was added to it, completing the 
construction of the gene. This construction is shown in 
Figure 10. 
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In Figure 12 a scheme for the production of an anti- 
sense gene specific to the intron region of clone L19 and 
the production of an intronless version of the number L19 
gene is shown. For the first step in this, a restriction 
fragment from clone L19 was isolated that is 
substantially all intron. This fragment was isolated by 
using the restriction enzyme Ddel which cuts at a number 
of sites in the genomic clone, but the sites at 
nucleotides 1186 and 1348 give rise to a restriction 
fragment that is substantially intron sequences, having 
only approximately 16 ricleotides at the 5' side of the 
intron that are included in the final transcript, and 10 
nucleotides at the 3' side of the intron that are 
included in the final transcript. This Ddel fragment was 
isolated by gel electrophoresis, made blunt ended and 
cloned into Smal cut pGEM 42. Clones in both 
orientations were obtained, and the clone that contained 
the intron region in the orientation: Hindlll, SphI , 
PstI, Sail, Xbal, BamHI (former Smal) 3' end of the 
intron, intron sequences, 5' end of the intron, (former 
Smal), Kpnl, .SstI,.EcoRI was chosen. This clone was 
named pPAL1914 and was digested with BamHI and SstI, and 
inserted into PAL1920 previously cut with BamHI and SstI, 
creating the vector PAL1954. 

To create a specific restorer vector, the cDNA clone 
19 cDNA (Figure 6) was fused to the promoter region as 
follows: The cDNA clone was digested with EcoRV and Smal 
as shown in Figure ll. To this cut vector was added the 
EcoRV - Smal fragment from clone L19. Clones were 
recovered that contained the reconstructed 5' region of 
the promoter and coding sequence, but carried coding 
sequences that lacked the intron, most of the coding 
regions being derived from the cDNA clone. This clone 
was digested with EcoRI, made blunt ended with Klenow, 
then digested with Smal. A DNA fragment that represents 
the entire coding region and a portion of the promoter 
region was isolated and cloned into PAL1920 cut with 
Smal, leading to a reconstruction of the promoter region 
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and coding region lacking an intron. This vector was 
called PAL1955. 

In Figure 13, the production of clones containing the 
coding sequences of a protein functionally related to the 
5 ricin A chain protein isolated from Ricinus communis is 
detailed. This was accomplished by first isolating a 
genomic clone homologous to ricin from a genomic library 
of Ricinus zanzabarenis DNA constructed in the vector 
lambda gtlO using standard protocols. The library was 

10 screened with a DNA probe that corresponds to the N- 
terminal leader sequence of the ricin gene. The probe* 
sequence was obtained from the published sequence of a 
Ricinus communis ricin gene (Hailing, et al . , Nucl. Acids 
Res 13:8019-33, 1985) . A genomic clone was isolated that 

15 contained the leader sequence and a portion of the A 
chain was isolated and called RIC IB. This clone 
contained the promoter region, the 5' untranslated 
region, the N-terminal leader sequence and coding region 
that extended to amino acid 191 (lie) in the published 

20 sequence (Hailing, et al . , Nucl. Acids Res 13:8019-33, 
1985) . The difference between the published sequence and 
RIC IB was that the published nucleotide sequence at the 
region of lie 191 was: (lie 191 underlined) ACG AGA ATT 
CGG which codes for the amino acids : The Arg lie Arg 

25 while in RIC IB the nucleotide sequence is ACG AGA ATT 
CGG, which codes for the same amino acids (The Arg lie 
Arg) , the only difference being the last Arg is coded for 
by CGG in RIC IB while in the published sequence it is 
coded for by AGG. This single nucleotide substitution 

30 has the effect of introducing a EcoRI site at lie 191. 

The clone RIC IB therefore was missing the amino acids 
present after lie 191 since the clone was isolated as a 
single EcoRI fragment. This truncated version of ricin 
was used for construction of ricin A chain N-terminal 

35 deletions as follows: The clone RIC IB was digested with 
Hindi and EcoRI. The fragment was cloned into EcoRI- 
Smal cut pGEM 42. The resulting clone. pPALRlB was 
digested with PstI and BamHI . This cut clone was 
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digested with Exo III nuclease, treated with SI nuclease 
and Klenow fragment and then relegated. Subclones, were 
obtained that had various portions of the 5' region 
deleted, and some of these deletions were sequenced. One 
deletion, named pPAL-Ridefin had the majority of the N- 
terminal leader DNA sequence removed and had Hindlll and 
SphI sites 5' to this region such that the DNA sequence 

was as follows: AAGCTT GCATGC GCA ACA TGG wherein the 

first six nucleotides code for a Hindlll site found in 
pGEM 42, the next six nucleotides codes for the SphI site 
in pGEM 4Z and the following three triplets code for 

amino acids -20, -19, -18 (Ala The Trp...) in the 

published sequence (Hailing, ' et al , , Nucl , Acids Res 
13:8019-33, 1985). This subclone therefore had a 
deletion that removed the first 15 amino acids of the A 
chain leader sequences of ricin. To provide a ATG start 
codon in a favorable initiation context, a synthetic 
oligonucleotide was constructed and inserted into the 
subclone between the Hindlll and SphI sites. This 
nucleotide had the sequence : 5' -ACGTGGATCCAAGATGACATG-3 ' . 
The reconstructed clone (pPAL Rictr) therefore had the 
DNA sequence (restriction site for a introduced BamHI 
site is underlined, the ATG start codon in bold) AACGT 
GGATCC AAG ATG ACA TGC GCA ACA TGG at the 5' region such 
that there was a ATG start codon in a favorable 
initiation context and a BamHI site for excision of the 
coding sequence. The clone pPAL Rictr was digested with 
EcoRI, end filled with Klenow, and dephosphorylated with 
alkaline phosphatase. To the vector was added the 
universal translational teirminator purchased from 
Pharmacia-PL biochemicals (Montreal, Canada) to provide 
a termination codon. The coding region from this clone 
was isolated by digestion with BamHI and pVUII, releasing 
the coding region and a small ' portion of pGEM 4Z, and 
this fragment can be cloned into the BamHI and Smal sites 
of transformation vectors. This DNA fragment codes for 
a version of a ricm A chain m which a C-terminal 
portion has been deleted. It should be noted that a 
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number of C- terminal and N- terminal deletions of the 
ricin A chain have been tested in vitro for toxicity,, and 
these reports have concluded that the N-terminal half of 
the ricin A chain is sufficient for cytotoxicity in vitro 
5 (for example see: Sudan et al . , Nucl . Acids Res, 
17:1717-32, 1989). In order to obtain a complete coding 
region for the ricin A chain, synthetic version of the 
rest of the A chain was synthesized using the published 
sequence. This synthetic portion of the gene extended 

10 the DNA sequence to nucleotide 1182 in the published 
sequence and had a EcoRI sites at both ends that allowed 
for the joining of this fragment to the EcoRI site at the 
amino acids numbers 190-192. This reconstructed version 
of the gene also had a BamHI site after the stop codon 

15 such that the DNA sequence of the gene at the 3' end was 
as follows: (nucl. 1181 is marked *) CCT CCA* TAA GGATCC 
GAATTC coding for amino acids: Pro Pro stop, followed by 
BamHI and EcoRI, the EcoRI site being used for the 
insertion of the synthetic portion of the gene, the BamHI 

2 0 being used for excision of the complete ricin A chain 
sequence since a synthetic BamHI site is at the 5' end of 
the coding region also. The clone was called pPAL 
Riccom. 

In Figure 14 the results of anti- sense RNA inhibition 
25 of Beta-glucuronidase gene activity is shown in a 
histogram of GUS gene activity found in transgenic plants 
that contained a sense GUS gene and were retransf ormed 
with a vector containing an anti-sense GUS gene. GUS 
activity levels were expressed as a percentage of the GUS 
30 activity found in the original GUS+ plant TTR-48. Tissue 
was obtained from young (Y) , medium (M) and old (O) 
leaves and GUS activity was assayed 
spectrophotometrically . Lanes 1-5, samples from the 
retransf ormed plants TTR-1 to TTR-5 (TTR-4 8/PAL1302 ) ; 
35 lane 6, samples from plant TTR-48; lane 7, samples from 
TTR-88 {TTR-48/pVUlOll) and lane 8, samples from 
uncransf ormed tobacco. 
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Figure 20 illustrates hybrid seed production using 
a variation of the binary cryptotoxicity method and a 
herbicide resistance gene. Genes 1 and 2 can be the lamS 
and lamH genes. The herbicide resistance gene can be any 
resistance gene against any herbicide. Parent A is 
produced with the three genes Gl , G2 , and H being linked 
by virtue of introduction on one recombinant DNA 
molecule, via transformation of a plant cell. it is 
maintained by crossing with a non- trans formed isogenic 
strain and spraying for herbicide resistance. Parent B 
is homozygous for a male sterility restoration gene su'-.h 
as ant i- sense for gene 1 and/or gene 2 and for a linked 
herbicide resistance gene. The resulting hybrid seed is 
essentially 100% herbicide resistant and restored in 
respect to male fertility. 

Figure 21 illustrates hybrid seed production with 
parent A being produced and maintained as shown in 
Figure 20. Parent B is homozygous to herbicide 
resistance. The hybrid is essentially 100% herbicide 
resistant . 

Figure 22a illustrates the use of the binary 
cryptotoxicity method for production of a male sterile, 
herbicide resistant parent A. Gene 1 and gene 2 come 
from different transformed lines, each homozygous for the 
recombinant DNA molecule, with either Gl, or G2, or both, 
being linked to a herbicide resistance gene. The 
resulting parent A is essentially 100% male sterile with 
Gl and G2 on different chromosomes. Parent A is 
herbicide resistant. 

Figure 22b illustrates the method used to maintain 
parent A shown in Figure 22a . 

Figure 22c illustrates the crossing of parent A shown 
in Figure 22a with parent B. Parent B is the result of 
a transformation where the restorer gene ( ant i sense 
sequences, or lexA, for example) and the herbicide 
resistance gene are linked on one recombinant DNA 
molecule. The result of crossing parent A and parent B 
is essentially a hybrxd chat is restored in regard to 



wo 96/40950 



- 98 - 



PCT/US96/08692 



male fertility and is herbicide resistant. If 
restoration to male fertility is not desired, parent B is 
transformed with a recombinant DNA molecule containing 
the herbicide resistance gene, but not the gene(s) 
5 responsible for restoring fertility. 

Example 1 

In this example, ant i- sense RNA was used specifically 
for the inhibition of gene activity in plants. A tobacco 
plant expressing the Beta -glucuronidase gene under the 

10 control of the CaMV 35S promoter was produced by 
transformation of a non- transformed control tobacco 
cultivar, tabaccum, cv. Delgold. To accomplish this, 
tobacco leaves less than 8 inches in length were surface 
sterilized by exposure to ethanol for 5-6 seconds, then 

15 subsequent exposure to 1% sodium hypochlorite for a few 
minutes, usually 5-10 minutes, or until the cut edge of 
the petiole turned white, then rinsed several times in 
sterile distilled water. Leaf segments of approximately 
0,5 to 1,0 square centimeters were cocultured for two. 

2 0 days with Acrrobacterium tumef aciens GV 3101 carrying the 

Ti plasmid pMP90 to provide vir functions in trans 
(described by Koncz, C. and Schell J., 1986, Mol, Gen. 
Genet. 204:383-396) carrying the binary vector pBI121.1 
on shoot inducing media. This vector is a derivative of 
25 Binl9 which contains the GUS gene driven by the CaMV 35S 
promoter and terminated by the nos ter and is available 
from Clonetech Laboratories, Palo Alto, CA. , USA. 
Transformed tobacco cells were selected on a shoot - 
inducing medium containing 0.8% agar, MS salts, B5 

3 0 vitamins, 3% sucrose, 1 mg per L of benzyladenine , 0 . 1 mg 

per L of alpha naphthalene acetic acid, (NAA) 300 ug/ml 
kanamycin and 500 ug/ml carbenicillin (essentially as 
described by Horsch et al . , 1985, Science, 227:1229-31) . 
Regenerated shoots were then transferred to a root- 
35 inducing medium consisting of B5 medium with 2% sucrose, 
500 ug/ml carbenicillin and 0.5 mg/L each of NAA and 
indoleacetic acid (lAA) . Following selection on 
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kanamycin, a tobacco transformant which displayed 
relatively high constitutive levels of GUS activity and 
contained a single unrearranged insertion of the 3 5S CaMV 
promoter-GUS-noster construct was selected. This plant 

5 (TTR-48, GUS+) was then re- transformed with a binary 
vector PAL1302 which contains an anti- sense GUS gene, the 
construction of which is described in Figure 1, In 
experiments involving the retransf ormation of the TTR-4 8 
tobacco plant with PAL13 02, the shoot -inducing medium 

0 contained 2 0 ug/ml hygromycin and 3 00 ug/ml kanamycin to 
insure the selection of plants containing both the sense 
and anti-sense GUS constructs. Transf ormants were grown 
to maturity and self -pollinated in the greenhouse. 

The leaves of tobacco plants resulting from the 

5 retransformation of TTR-48 {GUS+) with the anti-sense GUS 
vector PAL13 02 were assayed for GUS activity. The GUS 
activity in leaf extracts was assayed 
spectrophotometrically . Approximately 0.5 g of leaf 
tissue were ground with a Polytron in 2 ml of GUS 

0 extraction buffer (50 mM NaP04 pH 7.0, 1 mM EDTA, 0.1% 
Triton X-100, 10 mM B-mercaptoethanol , 1 mM p-nitrophenyl 
glucuronide, 100 ug/ml bovine serum albumin, 0.02% sodium 
azide) inciibated at 3 0**C for 6 h and the reaction stopped 
with the addition of 0,4 ml of 2 . 5M 2 -amino-2 -methyl -1, 

5 3 -propanediol . The amount of p-nitrophenol produced was 
calculated by measuring the absorbance at 415 nm. A 
stopped reaction containing an identical amount of leaf 
extract was used as a blank. Relative enzyme activities 
in the extracts were calculated and expressed as 

0 nanomoles of p-nitrophenol produced per mg of protein per 
minute- Ten plants were screene_d and they all displayed 
a large reduction in GUS activity levels relative to 
those observed in TTR-48. Five plants (TTR-1 to TTR-5) 
were chosen for a detailed analysis of the effects of 

5 anti-sense RNA inhibition of GUS gene activity. The 
retransf ormed tobacco plants TTR-1 to TTR-5 (Figure 14, 
lanes 1-5) all showed a considerable to complete (lane 4) 
reduction in GUS activity regardless of the developmental 
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Stage of the leaves examined. Comparison of the highest 
level of GUS activity observed in the original GUS+ plant 
TTR-48 (Figure 14, lane 6) with the highest level found 
in any of the plants retransf ormed with PAL1302 (lanes 1- 
5) shows that the reduction in GUS activity was at least 
90%. The levels of GUS activity found in the control 
plant TTR-88 (Figure 14, lane 7) were similar to those of 
the original GUS+ plant (lane 6) indicating that the 
retransf ormation and regeneration process undergone by 
TTR-48 were not responsible for the decline in GUS 
activity' observed in" TTR-1 to TTR-S, . ' Western "blot 
analysis of total protein extracts obtained from young 
leaves were performed. For extraction of leaf proteins, 
approximately 100 mg of tissue were ground in a 1 . 5 ml 
Eppendorf tube containing 0.7 ml of GUS extraction buffer 
described above. An equal volume of SDS PAGE 2X sample 
loading buffer (1.3 M Tris-Cl pH 6.8, 2% B- 
mercaptoethanol, 50% glycerol, 5 mM EDTA, 0.1% 
bromophenol blue) was added and the samples incubated at 
room temperature for 15 min. The extract was centrifuged 
at 12,000 rpm for 2 min and the supernatant was frozen at 
-80**C until use. Proteins were resolved in 10% SDS PAGE 
gels and immediately transferred electrophoretically onto 
a nitrocellulose filter. The GUS protein was then 
detected in the gels using the rabbit anti-B- 
glucuronidase antibody obtained from Clonetech 
Laboratories and an anti -rabbit IgG alkaline phosphatase 
conjugate kit (Promega Biotech, Madison WI, USA) 
according to the manufacturers instructions. Equal 
amounts of protein were separated by sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS PAGE) , 
transferred to nitrocellulose and cross-related with 
antibodies specific to the GUS protein. The amount of 
GUS protein detected on the Western blot correlated well 
with the GUS activity found in the leaves of all the 
tobacco plants examined whether these displayed high 
levels of activity such as TTR-4 8 or no discernible 
activity as in TTR-3 (Figure 14, lane 3) . The reduction 
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in GUS activity in TTR-1 to TTR-5 can therefore be 
directly attributed to lower quantities of GUS enzyme 
within these plants. Southern blot analyses were 
performed to confirm that the sense GUS gene was still 
present and intact in TTR-1 to TTR-5 and to verify that 
the anti- sense GUS construct had correctly integrated in 
the DNA of these plants. - It was found that the 
arrangement of the original sense GUS gene in TTR-4 8 was 
unaffected by the transformation with the anti-sense gene 
containing vector, and it was also further ascertained^ 
that pmongst the plants selected for this analysis; 
between 1-3 copies of the anti -sense gene were inserted 
into the genome of the plant which contained the sense 
gene. It was also determined that a single inserted 
anti-sense gene could lead to a near total reduction in 
sense gene activity. Northern blot analyses were 
conducted on total leaf RNA to determine if the reduction 
in the amounts of GUS enzyme observed in TTR-l to TTR-5 
correlated with their levels of GUS mRNA. Total RNA was 
prepared from tobacco leaves by grinding 0 . 5 g of tissue 
in 2 ml of extraction buffer (6 M guanidine-HCl , 0.1 M 
NaAc pH 6.0, 1.0% b-mercaptoethanol ) for 10-30 sec using 
a Polytron. The mixture was then centrifuged at 5,000 
rpm for 3 min, the supernatant layered on an equal volume 
of 5.7 M CsCl in 0.1 M NaCl , 10 mM Tris-Cl pH 7.5, ImM 
EDTA and centrifuged again at 35, 000 rpm for 16 h at 
15°C. The resulting RNA pellet was resuspended in 0.1 M 
NaAc pH 6.0, 0.1% SDS and extracted with an equal volume 
of phenol -chloroform (50:50, v/v) . The aqueous phase was 
adjusted to 0.3 M NaAc and the RNA precipitated with 2 
volumes of ethanol . Following centrif ugation, the pellet 
was washed in 70% ethanol and resuspended in sterile 
distilled water. RNA samples were resolved in the 
presence of methylmercuric hydroxide in 1.3% agarose gels 
and transferred onto a nylon membrane. The membranes 
were probed wich («P) UTP labelled sense or anti-sense 
GUS RNA transcripts. These transcripts were made from 
pGEM-GUS, a plasmid obtained by inserting the BamHI-SstI 
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fragment of pBI221.1 (which contains the entire- GUS 
coding sequence) into the BamHI-SstI sites of pGEM-4Z. 
Probes were made by cutting pGEM-GUS with EcoRI and then 
using T7 RNA polymerase to provide a transcript which can 
5 hybridize to anti- sense GUS RNA or by digestion with 
Hindi I I and transcription with SP6 RNA polymerase giving 
an anti -sense transcript that can hybridize to sense GUS 
mRNA only. A radiolabelled sense-specific GUS RNA probe 
demonstrated that the levels of GUS mRNA found in TTR-1 

10 to TTR-5 were considerably lower than those observed in 
TTR-48, the original GUS+ plant. As predicted, 

untransformed tobacco did not posses the GUS transcript. 
The levels of sense mRNA correlate well with the amount 
of GUS protein and activity observed in these plants. 

15 Northern blot analysis using an anti-sense-specific GUS 
RNA probe demonstrated the presence of anti -sense GUS 
transcript in the retransf ormed plants. The reduced 
amounts of GUS protein and GUS activity observed in TTR-1 
to TTR-5 can thus be attributed to the low levels of GUS 

20 mRNA found in these plants. Low levels of GUS mRNA were 
always associated with the presence of the anti -sense GUS 
RNA. These results clearly indicate that a target sense 
gene can be successfully inhibited using anti-sense RNA. 
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Example l A 

This example describes a method for isolating 
microspores in order to obtain genes that are essential 
to pollen formation and/or function. Microspores may be 
conveniently isolated by manual dissection of buds to 
remove anthers that are subsequently disrupted by gentle 
grinding in a mortar and pestle in 10 % sucrose. The 
extract is then filtered through a 44 urn nylon mesh and 
the microspores collected by centrif ugation at 3000 x g 
for one minute. The pelleted microspores are re- 
suspended in 10 % sucrose-, filtered and pelleted as 
before, other methods of isolating microspores can also 
be used. 

Tissues other than microspores can be disrupted by 
a variety of methods and the disrupted tissue used for 
RNA isolation. It is convenient to disrupt the tissue by 
using a motor driven homogenizer with lo mis of a 
solution of 6M guanidinium HCl , 0 . 1 M Na acetate, pH 6.0, 
0.1 M beta-mercaptoethanol per gram of tissue. . The 
homogenate is cleared by centrif ugation at 5,000 x g and 
the supernatant is layered over a solution of 6M CsCl in 
Tris-EDTA buffer (TE buffer) . Centrif ugation at 100,000 
X g for 12-20 hrs at IS'C is used to pellet RNA which is 
subsequently re-suspended in water and re -precipitated in 
the presence of 0 . 3 M Na acetate and two volumes of 
ethanol. rna is recovered by centrif ugation and re- 
suspended in water. The RNA obtained by such method can 
be fractionated by oligo-d-T cellulose chromatography to 
separate the polyadenylated mRNA from the bulk of the 
non-polyadenylated RNA. The microspore RNA is isolated 
by using a tight fitting motor driven glass homogenizer 
to disrupt the microspores. The homogenization of 300 ul 
of packed microspores is conducted in l ml 6M 
guanidinium-HCl, 0.1 M Na acetate, pH 6.0, 0.1 M beta- 
mercaptoethanol. The homogenate is centrif uged at 5000 
X g and the cleared supernatant is layered over a 
solution of 6M CsCl in TE buffer. An overnight 
centrif ugation at lOO.OO x g is used to pellet the RNA 
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which is subsequently re-suspended in water and re- 
precipitated in the presence of 0.3 M Na acetate and two 
volumes of ethanol . Other methods of RNA extraction can 
be used to obtain the RNA from the tissues described. 
Standard methodology using oligo-dT cellulose is used to 
obtain polyadenylated mRNA from these total RNA 
preparations . 

Example 2 

This example relates to the use of constitutive 
promoters to regulate the expression of pollen specific 
anti- sense RNA. The Cai-rv 35S promoter from pBl 221 was 
isolated as a Hindi 1 1 - Xbal fragment and cloned in 
PALlOOl previously cut with Hindlll and Xbal. This 
produced a vector that had the CaMV 3 5S promoter joined 
to the nos £er and in between the promoter and terminator 
were unique sites for : Xbal BamHI , Smal , Kpnl and SstI . 
This vector was named PA1.1007. PAL1007 was digested with 
BamHI and to this digested vector was added a 2.4 Kb 
BamHI fragment containing a coding region from clone L16 
in the anti-sense orientation. This vector was called 
PAL13 05 and was used to transform Brassica napus . 
Transformation was carried out either using the method 
described in Moloney, M.M.. et al . , (Plant Cell Reports 
(1989) 8:238-42) or, transformation can be carried out 
with surface sterilized stem epidermal layers. For this 
procedure, seeds of B^ napus L. ssp. oleifera cv. Westar 
were sown in 'Promix' mixed with 2g/L of the slow-release 
fertilizer 'Nutricoate' in 8" pots. Plants were grown in 
the greenhouse under a 16 photoperiod (using natural and 
artificial lighting) . For coculture and regeneration 
experiments stem- sections from the top three stem 
internodes of approximately 1.5 month old plants were 
used (i.e.. those with elongated floral spi)ces and 
several open flowers) . Intact stem-sections were surface 
sterilized for 30 seconds in 70% ethanol and 10 minutes 
in 1% sodium hypochlorite followed by three rinses in 
sterile distilled water. For transformation 
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Aqrobacterium tumefaciens GV 3101 carrying the Ti plasmid 
pMP90 to provide vir functions in trans and the binary 
vector PALllio was grovm on YEP media (which consists of 
10 gm per L of Yeast Extract, 10 gm per L of Bacto- 
pepetone and 5 gm per L of NaCl , pH 7 . 0 containing 100 
ugs per mL kanamycin for selection of bacterial cells 
that contained the binary vectors) . Cells were grown 
from one to two days at 28C. The cells were collected by 
centrifugation and were resuspended at an approximate 
density of 10« - lo' cells per mL in liquid EL which 
consists of MS micro- and macro- r itrients and B5 
vitamins containing 40 mg/L of FeNa-EDTA (obtained from 
BDH chemicals) and 3% sucrose, 10 mg/L BenzylAdenine , and 
0.5 mg/L alpha naphthalene acetic acid (NAA) and 18 . 8 mM 
KNO^ plus 20.6 mM NK^NO^ . Medium was solidified with 0.8% 
agar (Sigma) when the EL media was used for solid media 
plates. 

The cell suspension was poured into the bottom of a 
sterile petri dish and sterilized stems were dissected 
directly in the bacterial suspensions. The segments were 
sectioned longitudinally into half segments and cut into 
approximately 5 mm sections. The dissected segments were 
placed on filter paper discs on solid EL media for a 3 
day coculture under continuous fluorescent light (60 
microeinsteins/m2/sec2) at 25C. After a 2-3 day 
coculture, explants were transferred to solid EL media 
containing 500 ug/mL carbenicillin, and 100 ug/mL 
bekanamycin (Sigma) . Shoocs formed in 4-8 weeks, 
sections were transferred to fresh solid EL media with 
carbinicillin and bekanamycin every 3-4 weeks. Shoots 
that formed and did not bleach were excised and rooted on 
PDR media (B5-with 2% sucrose and 0.5 mg/L each of NAA 
and lAA) . In some cases, green non-regenerating callus 
growing on selective medium was separated from explants 
and transferred to fresh medium to stimulate 
regeneration. Transformed planes were placed in a 
misting chamber, and afcer cwo to four weeks transferred 
to the greenhouse. Planes were grown under a 16 hour 
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photoperiod and allowed to flower. Clonal propagation 
was used to increase plant lines as well as hand crossing 
and selection of seedlings from crossed plants on 
kanamycin containing media. This media consisted of 0.8% 
agar, one-tenth MS salts an lOOugs per mL bekariamycin 
with no sucrose in the media. Surface sterilized seeds 
were used. The seeds were. surface sterilized by rinsing 
in 70% ethanol for a few seconds, soaking in 1% sodium 
hypochlorite for 15 minutes, followed by rinsing three 
times in sterile distilled water. Seeds were place on 
the surface of the agar in sterile dishes and allowed t :> 
sprout. Plants which did not carry the kanamycin gene 
linked to the anti-sense gene bleached and died, while 
those that carried the antisense gene stayed green and 
were subsequently transferred to soil and allowed to 
flower. 
Example 3 

In this example, another pollen specific coding 
region was used with the vector PAL1007. In this case, 
a 1.3 Kb Hindlll fragment from clone L19 was isolated, 
made blunt ended and cloned into the Smal site of pGEM 
4Z. This subclone was called pPAL1914 and was then 
digested with Xbal and SstI . This fragment was added to 
Xbal - SstI cut PAL1007, giving rise to a vector called 
PAIil307, This vector contains the CaMV 35S promoter 
fused to a coding region from clone LI 9 in the ant i- sense 
orientation. This vector was used to transform Brassica 
napus as detailed in example 34. 
Example 4 

This example relates to the use of inducible 
promoters to regulate expression of pollen specific anti- 
sense RNA. In this example, the 1.2 Kb Hindlll - PstI 
fragment of the melanogaster 70 KD heat shock protein 
promoter was isolated from the subclone pPW 22 9 
(Holmgren, R. et al . , 1979, Cell 18: 1359-1370) and 
cloned into Hindlll - PstI cut pGEM 42, The heat shock 
promoter was excised as a Hindlll - Smal fragment and 
cloned into Hindlll - Smal cut PALIOOI. This produced a 
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vector (PAL1009) that contains a heat shock promoter 
followed by a portion of the polylinker and nos ter. The 
Smal site was used to clone the 1.3 Kb Hindlll fragment 
from clone L19 following making this fragment blunt 
ended. The clone containing this fragment in the anti- 
sense orientation relative to the heat shock promoter was 
called PAL1403. This vector was used to* transform 
Brassica napus as in example 34. Additionally, since a 
single EcoRI site exists at the end of the nos ter in 
this construct, a selection marker gene was added to this 
construct, the enzyme Beta-glucuronidase driven by t^ie 
CaMV 35S promoter using this unique EcoRI site for the 
insertion of this gene. This vector, which is the same 
as PAL1403 except that it now contains a convenient gene 
for scoring transformation was named PAL1408 and was used 
for transformation of Brassica napus as described in 
example 34. 
Example 5 

This example relates to the use of a promoter 
essential for pollen development and/or function to 
specifically express pollen specific anti-sense RNA in 
pollen cells. For this, the vector PAL1107 was used for 
the production of ant i -sense RNA from the cDNA clone 4F 
described in Figure 4. To construct this anti-sense 
vector, the EcoRI fragment from the cDNA clone 4F was 
isolated, made blunt ended with Klenow and .cloned into 
the Smal site of PAL1107. A vector that contained the 
cDNA clone in the anti-sense orientation (as determined 
by restriction enzyme analysis) was chosen. This vector 
was named PAL11074F and used to transform Brassica napus 
as described in example 34. 
Example 6 

For this example, the vector PAL1107 was used for the 
production of * ant i- sense RNA from the 2.4 Kb BamHI 
fragment of clone LIB described in example 2, To 
construct this anti -sense vector, the 2.4 Kb BamHI 
fragment from clone L16 was isolated and cloned into the 
BamHI site of PAL1107. A vector that contained this 
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fragment in the anti-sense orientation {as determine by 
restriction enzyme analysis) was chosen. This vector was 
named PAL1107-16CRAS and used to transform Brassica napus 
as described in example 34. 
5 Example 7 

For this example, the vector PAL11G7 was used for the 
production of anti-sense RNA from the 1.3 Kb Hindlll 
fragment of clone LI 9 described in example 3 . To 
construct this anti-sense vector, the 1.3 Kb Hindlll 

10 fragment from clone LI 9 was isolated and made blunt ended 
and cloned into the Smal site of PAL1107 . A vector that 
contained this fragment in the anti-sense orientation (as 
determine by restriction enzyme analysis) was chosen. 
This vector was named PAL1107-19CRAS and used to 

15 transform Brassica napus as described in example 34. 
Example 8 

For this, the vector PAL1107 was used for the 
production of anti- sense RNA from the cDNA clone related 
to clone LIO described in Figure 5. To construct this 

20 anti -sense vector, the EcoRI fragment from the cDNA clone 
was isolated, made blunt ended with Klenow and cloned 
into the Smal site of PAL1107. A vector that contained 
the cDNA clone in the anti-sense orientation (as 
determined by restriction enzyme analysis) was chosen. 

25 This vector was named PAL110710G and used to transform 
Brassica napus as described in example 34 . 
Example 9 

For this, the vector PAL1107 was used for the 
production of anti -sense RNA from the cDNA clone related 

30 to clone L19 described in Figure 6. To construct this 
anti -sense vector, the EcoRI fragment from the cDNA clone 
was isolated, made blunt ended with Klenow and cloned 
into the Smal site of PAL1107. A vector that contained 
the cDNA clone in the anti-sense orientation (as 

3 5 determine by restriction enzyme analysis) was chosen. 

This vector was named PAL110719 and used to transform 
Brassica napus as described in example 34 . 
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Example 10 

In this example, the promoter essential for pollen 
development and/or function in the vector PAL1421 was 
used for the production of anti- sense RNA using the cDNA 
clone homologous to the gene contained with LIO. This 
was accomplished by excising the cDNA. clone from the 
cloning vector with EcoRI and making this fragment blunt 
ended with Klenow, This blunt ended cDNA fragment was 
cloned into the Smal site of PAL1421. Clones were 
recovered that contained the cDNA insert in both 
orientations, and one was chosen that contained the' 
insert in the ant i -sense orientation relative to the 
promoter of PAL1421, resulting in the formation of a 
binary transformation vector PAL1492. PAL1492 was used 
to transform Brassica napus as described in example 34. 
Example 11 

In this example, the promoter essential for pollen 
development and/or function in the vector PAL1121 was 
used for the production of anti-sense RNA using the cDNA 
clone homologous to the gene contained within LIO. This 
was accomplished by excising the cDNA clone from the 
cloning vector with EcoRI and making this fragment blunt 
ended with Klenow. This blunt ended cDNA fragment was 
cloned into' the Smal site of PAL1121. Clones were 
recovered that contained the cDNA insert in both 
orientations, and one was chosen that contained the 
insert in the anti- sense orientation relative to the 
promoter of PAL1121, resulting in the formation of a 
binary transformation vector PALlllO. PALlllO was used 
to transform Brassica napus as described in example 34. 
Example 12 

In this example, the promoter essential for pollen 
development and/or function from clone 19 was used for 
the expression of pollen specific anti -sense RNA. The 
transformation vector PAL1920 was digested with Smal. To 
the Smal sice was added the DNA fragment corresponding to 
the cDNA clone homologous to clone LIO by digesting the 
vector containing this clone with EcoRI and making the 
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fragment blunt ended. This blunt ended cDNA fragment was 
cloned into the Smal site of PAL1920. Clones, were 
recovered that contained the cDNA insert in both 
orientations, and one was chosen that contained the 
insert in the anti-sense orientation relative to the 
promoter of pPAL1920, resulting in the formation of a 
binary transformation vector PAL1921. PAL1921 was used 
to transform Brasslca. napus as described in example 34. 
Example 13 

In this example, anti-sense RNA is made specifically 
to the intron region of clone L19 and a ^.pecific restorer 
gene is made which lacks the intron from clone L19. The 
construction of these two vectors is outlined in 
Figure 12. PAL1954 was used to transform Brassica napus 
as in example 34 to make a male sterile line. To create 
a specific restorer plant line, the vector PAL.1955 was 
used to transform Brassica napus as in example 34. 
Example 14 

In this example, the highly active promoter fragment 
of clone HPlOl we used to synthesize anti-sense RNA to 
the intron region, of clone 19. For this example, the 
subclone containing the intron region of clone 19 that is 
detailed in Figure 12, pPAL1914 was digested with Hindlll 
and BamHI. The promoter fragment from clone pPALHPlOl 
was added to this construct as a Hindi I I - BamHI fragment 
giving rise to clones containing the promoter fragment in 
the anti-sense orientation relative to the intron. This 
clone contained the promoter in an orientation such that 
transcription of the promoter would cause the production 
of an RNA a portion of which would contain anti- sense RNA 
homologous to the intron region of clone L19. This clone 
was called pPAL19HP. The clone pPAL19HP was digested 
with Hindlll and SstI, and cloned into the vector PALlOOl 
using the Hindlll and SstI sites of PALlOOl, creating 
PALHP19, PALHP19 was used to transform Brassica napus as 
in example 34. It should be noted that the vector 
PAL1955 (see example 13 and Figure 12) can be used for 
fertility restoration in plants chat carry PALHP19, 
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Example 15 

The promoter essential for pollen development and/or 
function contained in the subclone pPAL0420 was used for 
the construction of an anti -sense RNA gene under the 
control of a promoter essential for pollen development 
and/or function as follows: A DNA fragment coding for a 
polyubiquitin of Arabadopslff was isolated from a plasmid 
which contains a polyubiquitin gene that has 5 copies of 
the ubiquitin monomeric protein was obtained from the 
University of Wisconsin, Madison, WI, USA and is 
descr-'bed in Burke e t ai . , Molecular and" General 
Genetics, in press. A BamHI - Bglll fragment was 
isolated that contains 3 of the 5 copies of the 
polyubiquitin gene and this fragment was inserted into 
PPAL0420 by using the single BamHI site of the polylinker 
of PAL0420. This gave rise to a plasmid containing the 
promoter essential for pollen development and/or function 
from clone L4 followed by an DNA fragment containing 
ubiquitin coding sequences in the anti-sense orientation 
followed by the sos ter polyadenylation signal. This 
promoter/anti -sense gene construct was excised from 
PPAL0420 by digestion with EcoRI . The EcoRI fragment 
that contains the promoter anti -sense gene was inserted 
into the EcoRI site of the polylinker of Binl9, resulting 
in the formation of a binary transformation vector was 
named PAL1479. PAL1479 was used to transform Brassica 
napus as described in example 34 . 
Example 16 

The vector PAL1479 was used to transform tobacco as 
described in example 34 . 
Examplf> 1 7 

The promoter essential for pollen development and/or 
function contained in the subclone pPAL04 20 was used for 
the construction of a pollen specific anti -sense RNA gene 
as follows: A DNA fragment coding for pine actin was 
obtained from J. Kenny-Byrne, Pecawawa Canada. Two 
clones were obtained, Pac l-A and Pac 2. the clone Pac 1- 
A being described m: Canadian Journal of Forestry 
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Research (1988) 18:1592-1602, and the second clone Pac 2 
(the sequence being closely homologous to that in Pac 1-A 
and the nucleotide sequence of which having been 
submitted for publication) . A SphI fragment was isolated 
5 from Pac 2 that contains the complete coding sequence of 
pine actin. This fragment also contains a small amount 
of 5' and 3' non-coding region. This SphI fragment was 
cloned into the unique SphI site of pGEM 4Z. From this 
subclone, a Xbal fragment was isolated that contains only 

10 coding region and this Xbal fragment was cloned into the 
unique EcoRI site of pGEM 4Z. A clone was' chosen that 
had the orientation such that the 5' end of the gene was 
next to the BamHI site in the polylinker and the 3' end 
of the gene was next to the Sail site in the polylinker. 

15 This plasmid was called pPAL PAC. The actin coding 
region was isolated form pPAL PAC by digestion with BamHI 
and Sail. This BamHI - Sail fragment was cloned into 
pPAL0420 using the BamHI and Sail sites contained within 
the polylinker. of pPAIi0420. This gave rise to a plasmid 

20 containing the promoter essential for pollen development 
and/or function from clone L4 followed by an DNA fragment 
containing the actin coding sequence in the anti- sense 
orientation followed by the nos ter polyadenylation 
signal . This promoter/anti-sense gene construct was 

25 excised from pPAL0420 by digestion with EcoRI, The EcoRI 
fragment that contains the promoter anti -sense gene was 
inserted into the EcoRI site of the polylinker of Binl9, 
resulting in the formation of a binary transformation 
vector PAL1498. PAL1498 was used to transform Brassica 

30 napus as described in example 34. 
Example IS 

The vector PAL1498 was used to transform tobacco 
plants as outlined in example 34. 
Example 19 

In this example, tobacco that has been previously 
transformed to hygromycin resistance was transformed with 
an anti-sense gene that specifically blocks the 
hygromycin resistance in the pollen cells by virtue of 
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the fact that the anti-sense gene is under the control of 
a promoter essential for pollen development and/or 
function in the vector PAL1106 . Transformed tobacco that 
were resistant to hygromycin were obtained using the 
vector PAL1302 which is described in example 1. 
Selection of tobacco plant cells resistant to hygromycin 
was via coculture with PAL13 02 and selection of 50 ug per 
ml hygromycin. Southern blot analysis demonstrated the 
presence of 5-6 copies of the sense hygromycin 
phosphotransferase gene in one plant. This plant, 
referred to as TTR-12.- , was retransformed with a vector 
called PAL1107A. Palll07A is the vector PAL1106, to which 
has been added the 0.8 Kb BamHI hygromycin 
phosphotransferase fragment isolated from PAL13 02 and 
inserted into PAL1106 in the anti-sense orientation 
relative to the promoter essential for pollen development 
and/or function in PAL1106. The resulting vector was 
called PAL1107A. Plants obtained from this 

transformation were resistant to both hygromycin and 
kanamycin in the leaf tissue and were shown by southern 
blot analysis to contain the anti- sense gene. These 
plants were allowed to grow in the greenhouse and self 
fertilize. Clonal propagation of these plants were used 
as a preliminary increase of single plants, and these 
clonally propagated plants were used for the production 
of male sterile plant line. For example, a plant which 
contained a single copy of the anti-sense gene and was 
derived from the transformation of TTR-122 was planted in 
a sand soil mixture and allowed to grow in the 
greenhouse. This plant is referred to as TTR-203. 
Measurement of the hygromycin phosphotransferase activity 
in this plant demonstrated high accivity in leaves, 
petals, stigma and pistal and anther walls, but very low 
levels in pollen. TTR-122 showed high levels of 
hygromycin phosphotransferase activity in leaves, petals, 
stigma and pistal and anther walls and in pollen. This 
demonstrated that the anti -sense gene was effective in 
blocking che expression of the sense gene only in the 
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pollen. Northern blot analysis confirmed the presence of 
the anti-sense transcript specifically in the pollen of 
TTR-203 and also demonstrated low levels of the sense 
gene mRNA. When flower buds first appeared, TTR-203 was 
5 watered three times weekly with a solution of hygromycin 
(250 ugs per mL) , thoroughly saturating the sand soil 
mixture. This watering was continued for approximately 
four weeks, or the period of time in which the major 
flowering was occurring. Flowers produced during this 

10 time on plants containing the anti- sense gene were male 
sterile. Anther and pc'.len formation was inhibited, and 
mature pollen failed to develop. Female fertility was 
unaffected by this treatment as hand pollination could be 
used for the pollination of the female portion of the 

15 male sterile flowers. Pollen that was from a hygromycin 
resistant plant was used for the hybrid seed production. 
Watering of the plants with hygromycin was stopped, and 
normal watering was resumed. Flowers that fozTned on' 
plants after the hygromycin watering was stopped were 

20 male fertile and set selfed seed. 
Example 2 0 

The pollen specific vector PAIj1107 was used for the 
production of male sterile plants. The plant TTR-122 
described above, was retransf ormed with a vector called 

25 PAL1107HYGAS . Transformation was conducted as described 
in example 34. PAL1107HYGAS is the vector PAL1107 to 
which has been added the 0.8 Kb BamHI hygromycin 
phosphotransferase fragment isolated from PAL13 02 and 
inserted into PAL1107 in the anti -sense orientation 

3 0 relative to the promoter essential for pollen development 
and/or function in PAL1107. Plants obtained from this 
transformation were resistant to both hygromycin and 
kanamycin in the leaf tissue and were shown by southern 
blot analysis to contain the anti -sense gene. These 

35 plants were allowed to grow in the greenhouse and self 
fertilize. Clonal propagation of these plants were used 
as a preliminary increase of single plants, and these 
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clonally propagated plants were used for the production 
of male sterile plant lines as in example 19. 
Example 21 

For the production of male sterile plants, the vector 
PAL1419 was used to transform tobacco as outlined in 
example 34. The vector PAL1419 contains the promoter 
essential for pollen development and/or function from 
clone L4 controlling the expression of the NPT II gene 
oriented in the anti -sense orientation relative to the 
promoter essential for pollen development and/or 
function. This vector also cohtai-vs a constitutrve 
version of the NPT II gene in the sense orientation 
driven by the nos promoter. This vector therefore can 
confer resistance to kanamycin in all plant cells except 
pollen cells wherein the expression of the sense gene is 
inhibited by the expression of the anti -sense gene which 
is specifically expressed in the pollen. Tobacco plants 
were obtained following transformation of the vector 
PAL1419. These plants were rooted and allowed to set 
flower. Plants were watered with kanamycin while 
flowering. 
ExamTjle 2 2 

The vector PAL1419 was used to transform Brassica 
napus as outlined in example 34. 
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Example 23 

The vector PAL1419 was used for the transformation 
of petunia leaf discs. 

Example 24 

5 In this example, the complete ricin A chain (pPAL 

Riccom) described in Figure 13 was inserted into the 
vector PAL1420 as a BamHI fragment. Vectors were 
recovered that contained the ricin gene in both the sense 
and anti- sense orientation. A vector that contained the 

10 ricin A chain in the sense orientation was recovered and 
called PAL1420RIC. A vector that contained the gene in 
the anti-sense orientation was called PALil420RICAS . 
PAL1420RIC and PAL1420RICAS were used for transformation 
of Brassica napus as described in example 34 giving rise 

15 to plants that carried either the sense or anti-sense 
copy of the ricin A chain gene under the control of the 
promoter essential for pollen development and/or function 
in PAL1420. 
Example 25 

2 0 In this example, the truncated version of the ricin 

A chain (pPAL Rictr) described in Figure 13 was inserted 
into the vector PAL1420 as a BamHI fragment. Vectors 
were recovered that contained the truncated ricin gene in 
both the sense and anti-sense orientation. A vector that 
25 contained the ricin A chain in the sense orientation was 
recovered and called PAL1420tRIC. A vector that 
contained the gene in the anti -sense orientation was 
called PAL1420tRICAS. PAL1420tRIC and PAL1420tRICAS were 
used for transformation of Brassica napus as in example 

3 0 34 giving rise to plants that carried either the sense or 

anti-sense copy of the truncated ricin A chain gene under 
the control of the promoter essential for pollen 
development and/or function in PAL1420. 
Example 2 6 

35 PAL1420RIC and PAL1420RICAS were used for 

transformation of tobacco plants as in example 34, giving 
rise to plants chac carried either the sense or anti- 
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sense copy of the ricin A chain gene under the control of 
the promoter essential for pollen development and/or 
function in PAL1420. 
Example 27 

The vectors PAL1420tRIC and PAL1420tRICAS were used 
for transformation of tobacco plants as in example 34 
giving rise to plants that carried either the sense or 
anti- sense copy of the truncated ricin A chain gene under 
the control of the promoter essential for pollen 
development and/or function in PAL1420. 
Example 28 

For this example, the vector PAL1423 was used to 
express the polylysine gene contained in the subclone 
pPAL pLys . The coding region from pPAL pLys was isolated 
by BamHI digestion and cloned in the sense orientation 
into BamHI cut PAL1423, giving rise to PAL14 87. PAL1487 
was used to transform tobacco as in example 34. 
Example 2 9 

For this example, the vector PAL1920 was used to 
express the polylysine gene contained in the subclone 
pPAL pLys . The coding region from pPAL pLys was isolated 
by BamHI digestion and cloned in the sense orientation 
into BamHI cut PAIil920, giving rise to PAL1987. PAL1987 
was used to transform tobacco as in example 34 . 
Example 3 0 

In this example, a promoter essential for pollen 
development and/or function was used to synthesize a 
protein molecule that is destructive to cellular function 
and development, namely the protease trypsin. The cDNA 
sequence coding for trypsin has been described by 
Stevenson et al . , Nucl Acids Res, 1986, 14:8307-30. The 
CDNA clone pMPt9 was obtained and used for the production 
of a modified trypsin molecule in which the N- terminal 
amino acid residues were removed to give a protein that 
consisted of solely the active protease form of trypsin 
and differed from the mature form in that there was a 
methionine residue ac the N-terminal position of the 
mature protein, replacing the Isoleuecine found at this 
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position in the active mature protein. This was 
accomplished by digesting the plasmid pMPt9 with Fokl and 
PstI, and recovering a fragment that encompasses 
nucleotides 81 to 835, the nucleotides after 835 being 
the G:C tail used for cloning of the cDNA, and treating 
this fragment with Klenow, and cloning into blunt ended 
Smal cut M13mpl9RF, and isolating a single stranded phage 
clone that was used for site specific metagenesis to 
change the isoleucine codon at nucleotides 84-86 in the 
published sequence from ATT to ATG, introducing a 
initiation cordon where the isoleucine codon was. The 
mutated gene recovered was excised with Sail and SstI, 
and inserted into PAL1421 and named PAL1456. The vector 
PAL1456 was used to transform tobacco as in example 34. 
It should be noted that a restorer gene can be made if a 
trypsin inhibitor is inserted in an analogous fashion 
using a clone L4 derived promoter and transformed into a 
male parent line. The expression of the trypsin 
inhibitor in the hybrid will specifically block the 
activity of the trypsin enzyme. A number of cDNA and 
genomic DNA sequences can be found for soybean and other 
trypsin inhibitors, for example see: Jofuku, K.D. and 
Goldberg, R.B,, The Plant Cell 1989, 1:1079-1093. 
Example 31 

In this example, we use a promoter essential for 
pollen development and/or function to synthesize the 
enzyme lamH specifically in pollen cells. The enzyme has 
activity that can cause the production of NAA from NAM, 
the substance NAA functioning as a plant hormone that is 
sxibstantially toxic to developing pollen grains, while 
the precursor NAM being relacively non-toxic. For this 
example, the lamH gene was inserted into the vector 
PAL1423, The lamH gene was isolated from pPCV311 as 
described in Figure 19 and cloned as a BamHI - SstI 
fragment in the BamHI - SstI sites of PAL1423, creating 
PAL1424. This vector has the lamH gene (T-DNA gene 2) 
under the control of a promoter essential for pollen 
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development and/or function from clone L4 , PAL1424 was 
used to transform tobacco as outlined in example 34. 
Example 3 2 

The vector PAL1424 was used to transform Brassica 
5 napus as outlined in example 34. 
Example 3 3 

In this example, the vector PAL1107 was used for the 
production of tissue-specific GUS (beta-glucuronidase) 
enzyme . The gene for this enzyme is available from 

10 Clonetech Laboratories, Palo Alto, CA. , USA. The gene 
was inserted into PAL1107 as a BamHI - SstI ' fragment and - 
was used to transform tobacco as in example 34, Plants 
produced had detectable GUS activity only in developing 
pollen cells, and not any other tissue tested. It should 

15 be noted that application of non- toxic analog of 
glucuronic acid to which has been conjugated a toxic 
molecule such as glyphosate could be applied to these 
plants and cleavage of the toxic moiety from the 
glucuronic acid would occur only in pollen cells. This 

20 provides an example of an enzyme that could be used for 
the production, in. a tissue-specific fashion, of a toxic 
substance from a non- toxic analog. 
Example 3 4 

This example describes methods used to transform 

25 tabacco and Brassica napus. 

For tobacco transformation, the tobacco cultivar, N. 
tabaccum, cv. Delgold was used. To accomplish this 
transformation, tobacco leaves less than 8 inches in 
length were surface sterilized by exposure to ethanol for 

30 5-6 seconds, then subsequent exposure to 1% sodium 
hypochlorite for a few minutes, usually 5-10 minutes, or 
until the cut edge of the petiole turned white, then were 
rinsed several times in sterile distilled water. Leaf 
segments of approximately 0.5 to 1.0 square centimeters 

3 5 were excised from the sterile leaves, and were cocultured 
on shoot inducing media for two days with Agrobacterium 
tumefaciens GV 3101 carrying the Ti plasmid pMP90 to 
provide vir functions xn crans (described by Koncz, C. 
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and Schell, j,, 1986, Mol . Gen. Genet. 204:383-396) 
carrying the binary vector of interest , The vector is 
usually a derivative of Binl9 which contains the NPT II 
gene driven by the nopal ine synthase promoter and 
5 terminated by the nos ter for selection of plant cells 
with kanamycin. Binl9 is available from Clonetech 
Laboratories, Palo Alto, CA. , U.S.A. Transformed tobacco 
cells are selected on a shoot -inducing medium containing 
0.8% agar, MS salts, B5 vitamins, 3% sucrose, 1 mg per L 

10 of benzyladenine, 0.1 mg per L of alpha naphthalene 
acetic acid, (NAA) 300 /xg/ml kanamycin and 500 /xg/ml 
carbenicillin (essentially as described by Horsch et al . , 
1985, Science, 227:1229-31) . Regenerated shoots are then 
transferred to a root- inducing medium consisting of B5 

15 medium with 2% sucrose, 500 ;ig/ml carbenicillin and 0.5 
mg/L each of NAA and indoleacetic acid (lAA) . Rooted 
transf ormants are transferred to a misting chamber 
containing high humidity, after which the humidity is 
gradually lowered and plants are subsequently transferred 

2 0 to the greenhouse , 

For transformation of Brassica napus, the binary 
vector containing Agrobacterium strain GV 3101 carrying 
pMP90 to provide vir functions in trans is used. 
Transformation was carried out either using the method 
25 described in Moloney, M.M., et al . , (Plant Cell Reports 
(1989) 8:238-42) or, transf oormat ion can be carried out 
with surface sterilized stem epidermal layers. For this 
procedure, seeds of B. napus L. ssp. oleifera cv. Westar 
were sown in 'Promix' mixed with 2g/l of the slow-release 

3 0 fertilizer 'Nutricoate' in 8" pots. Plants were grown in 

the greenhouse under a 16 photoperiod (using natural and 
artificial lighting) , For coculture and regeneration 
experiments stem-sections from the top three stem 
internodes of approximately 1 . 5 month old plants were 
3 5 used (i.e., those with elongated floral spikes and 
several open flowers) • Intact stem-sections were surface 
sterilized for 30 seconds in 70% ethanol and 10 minutes 
in 1% sodium hypochlorite followed by three rinses in 
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Sterile distilled water. For transformation 

Agrobacterium tumefaciens GV 3101 carrying the Ti plasmid 
PMP90 to provide vir functions in trans and the binary 
vector of choice was grown on YEP media (which consists 
of 10 gm per L of Yeast Extract, 10 gm per L of Bacto- 
pepetone and 5 gm per L of NaCl , pH 7.0 containing 100 
ugs per mL. kanamycin for selection of bacterial cells 
that contained the binary vectors) . Cells were grown 
from, one to two days at 28C. The cells were collected by 
centrifugation and were resuspended at an approximate 
density of 10<* - lo' cells per mL in liquid EL which 
consists of MS micro- and macro- nutrients and B5 
vitamins containing 40 mg/L of FeNa-EDTA (obtained from 
BDH chemicals) and 3% sucrose, 10 mg/L BenzylAdenine , and 
0.5 mg/L alpha naphthalene acetic acid (NAA) and 18.8 mM 
KNO^ plus 20.6 mM NH4N0^ Medium was solidified with 0.8% 
agar (Sigma) when the EL media was used for solid media 
plates . 

The cell suspension was poured into the bottom of a 
sterile petri dish and sterilized stems were dissected 
directly in the bacterial suspensions. The segments were 
sectioned longitudinally into half segments and cut into 
approximately 5 mm sections. The dissected segments were 
placed on filter paper disc on solid EL media for a 3 day 
coculture under continuous fluorescent light (.60 
microeinsteins/m2/sec2) at 25C. After a 2-3 day 
coculture, explants were transferred to solid EL media 
containing 500 ug/mL carbenicillin, and 100 ug/mL 
bekanamycin (Sigma) . Shoots formed in 4-8 weeks, 
sections were transferred to fresh solid EL media with 
carbinicillin and bekanamycin every 3-4 weeks. Shoots 
that formed and did not bleach were excised and rooted on 
PDR media (B5- with 2% sucrose and 0.5 mg/L each of NAA 
and lAA) . In some cases, green non- regenerating callus 
growing on selective medium was separated from explants 
and transferred to fresh medium to stimulate 
regeneration. Transformed plants were placed in misting 
chamber, and after two - four weeks transferred to the 
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greenhouse. Plants were grown under a 16 hour 

photoperiod and allowed to flower. 

Clonal propagation was used to increase plant lines 
as well as hand crossing and selection of seedlings from 
5 crossed plants on kanamycin containing media. This media 
consisted of 0.8% agar, one-tenth MS salts and 100 ugs 
per mL bekanamycin (available from Sigma Chemicals, St, 
Louis, MO., U.S.A.) with no sucrose in the media. 
Surface sterilized seeds were used. The seeds were 

10 surface sterilized by rinsing in 70% ethanol for a few 
seconds, soaking in 1% sodium hypochlorate for 15 
minutes, followed by rinsing three times in sterile 
distilled water. Seeds were placed on the surface of the 
agar in sterile dishes and allowed to sprout. Plants 

15 which did not carry the kanamycin gene linked to the 
antisense gene bleached and died, while those that 
carried the antisense gene stayed green and were 
subsequently transferred to soil and allowed to flower. 
Example 3 5 

20 This example describes the isolation of two genes 

involved in tumor formation in plant tissues following 
infection with Agrobacterium, the lamS and the lamH genes 
from the Ti plasmid of the Agrobacterium tumefaciens 
strain CSS. The isolation of the lamH gene is 

25 particularly described. The source of DNA coding for 
these genes was the plasmid pPCV 311. The plasmid 
pPCV311 is described in: Koncz, C, and Schell, J., 
Molecular and General Genetics, (1986), 204:383-396, and 
contains the oncogenic region of the T-DNA plasmid 

30 contained in the C58 strain of Agrobacterium, The 
plasmid pPCV 311, contains a region of T-DNA that when 
transferred to plant cells causes tumor formation. This 
oncogenic region of the T-DNA is entirely contained in 
the plasmid pPCV-311. This region of DNA contains four 

35 genes, that when expressed in plant cells are sufficient 
for tumor formation. The approximate coding regions of 
these four genes and the direction of transcription of 
these four genes are indicated m Figure 15. The other 
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portions of the vector pPCV 311 are not shown in that 
they are not relative to the following constructions. 
Additionally, the oncogenic region of the Agrobacterium 
strain C58 is located on the T-DNA plasmid within that 
5 bacterium, commonly referred to as the wild- type nopaline 
plasmid. A nearly identical oncogenic region is also 
found in wild type octopine strains which could also be 
used as a source of genes. The complete nucleotide 
sequence of an octopine strain oncogenic region is 

10 described by Barker et al , , Plant Molecular Biology 
2:335-50- (1983) . The^p^rtial sequence obtained from 
various constructs of genes derived from pPCV 311 was 
compared to the published nucleotide sequence. 

Two genes were isolated from pPCV 311, the lamH and 

15 the lamS genes, commonly referred to as genes 2 and 1 
respectively. The lamH gene was isolated by first 
subcloning the indicated Hindlll fragment, a fragment 
that contains all of the coding region of gene 2 and 
additional 5' sequences that were subsequently removed 

20 for the construction of a promoterless version of the 
gene. The restriction sites mapped in this subclone are 
shown in Figure 15 and the subclone is referred to as 
pPAL G2 . For the isolation of coding secpiences only, 
pPAL G2 was first split into two smaller clones and the 

25 gene later reconstructed. The Xbal - Smal and Smal - 
Smal fragments shown in Figure 15 were isolated by gel 
elution and subsequently cloned into the following 
vectors : The Smal - Smal fragment was cloned into pGEM 
42, giving rise to pPAL899 . The Xbal - Smal fragment was 

30 subcloned into pGEM 7Z, giving rise to pPAL898. The 5' 
non- coding sequences of the lamH gene that are present in 
this subclone were removed in the following fashion: 
pPAL898 was digested with EcoRI , the EcoRI site is in the 
promoter region of the clone, and in this subclone is the 

35 only EcoRI site. This digested DNA was then treated with 
Exonuclease III, and following digestion created with SI 
nuclease and the Klenow fragment of DNA polymerase I . 
The treated DNA was then cut with PstI and treated with 
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Klenow fragment in order to make the PstI site blunt . 
The linear, digested, blunt ended plasmid was then 
relegated and used to transform E. coli DH5 -alpha 
according to standard protocols. Siibclones were chosen, 
5 sequenced and one subclone was chosen that was deleted to 
8 nucleotides in front of the ATG start of translation 
codon. The ATG start codon was determined by comparison 
of the nucleotide sequence obtained from the deleted 
subclones to the nucleotide sequence for the octopine 

10 strain described by Barker, et al . , Plant Molecular 
Biology 2:335-50 (19B?-) . The nucleotide sequences of 
both the 5' non-coding and the coding region were nearly 
identical. This subclone was named pPAL897, the ATG 
codon is shown in Figure 15, the direction of 

15 transcription in this case would be from right to left in 
Figure 15. The plasmid contained the 5' half of the 
coding region from the lamH gene, with the promoter 
sequences deleted . 

The construction of the 3' half of the lamH gene, 

20 contained in the plasmid pPAIiSSB was carried out as 
follows. A 3' region of the gene that contains the 
polyadenylation signal naturally found in the gene was 
isolated by digestion pPAL8 98 with the enzymes BamHI and 
Apal. The digested DNA was treated with Klenow fragment 

25 to make it blunt ended and was relegated. This gave rise 
to the subclone pPAL896, which is a plasmid that contains 
the 3' half of the lamH gene. To reconstruct the intact 
lamH gene, pPAL8 9 6 was digested with Hindi! I and Smal, 
and the 3' half gene fragment was isolated by gel 

30 elution. pPAL897 was digested with Smal and Hindlll and 
the isolated 3' fragment from pPAL8 96 was cloned into 
these sites, reconstructing a promoterless version of the 
gene that contains the indicated array of restriction 
sites flanking the gene. This plasmid was named pPAL895 

35 and is shown in Figure 15. 
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Examples 3 6 

This example describes the isolation and construction 
of a promoterless version of the gene 1, lamS: indole 
acetamide synthase gene of the Ti plasmid of the 
Agrobacterium tumefaciens . strain 058 which procedure is 
summarized in Figure 16 . The gene was isolated from the 
plasmid pPCV311. The Smal - PstI fragment that contains 
5' and 3' regions of the lamS gene as well as the coding 
region was isolated by gel elution and subcloned into a 
derivative of pGEM 42 called pGEM-noEco. pGEM-noEco is 
a plasmid from which the EcoRI site of pGEM 4Z has been 
removed by cutting with EcoRI and making blunt ended and 
relegating such that only the EcoRI site was removed. 
The fragment was inserted in the orientation shown 
relative to the polylinker. This subclone was called 
PPAL889. PPAL889 was digested with EcoRI, and briefly 
treated with Exonuclease III, followed by SI nuclease. 
The DNA was digested with Smal and treated with Klenow 
fragment to make it blunt ended. The DNA was relegated 
and clones recovered. Some of these clones were chosen, 
sequenced, and one clone was found which had 5' sequences 
deleted such that only approximate 15 bases upstream of 
the ATG start of translation codon remained. This 
plasmid was named pPAL888. The Kpnl site at the 5' end 
of the gene as well as the PstI site at the 3' end of the 
gene were both converted to Sail sites by cutting with 
Kpnl, end filling with Klenow and adding synthetic Sail 
linkers, and repeating the linker addition. at the PstI 
site such that the entire gene can be excised as a single 
Sail fragment. This plasmid was named pPAL887. This 
plasmid contains the promoterless version of the lamS 
gene and contains the array of restriction sites shown 
that flank the gene as shown in Figure 16 . 
Example 21 

This example relates to the detailed characterization 
of the coding regions of clone number L4 a microspore 
specific clone isolated from a Brassica napus genomic 
library and the construction of vectors containing these 
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genes. Figure 2a is a schematic representation of the 
restriction map and coding regions of clone number L4 . 
The clone contains three different members of the same 
gene family. These genes are identified as Bp4A, Bp4B 
5 and Bp4C. The first (Bp4A) and third (Bp4C) genes are 
functional, the second gene has modifications that most 
likely render it non- functional . The restriction map is 
made diagramatically in that the non- transcribed regions 
are shown as a single line, while the transcribed regions 

10 are shown as a boxed area. The second gene (Bp4B) is 
identified on the basis of sequence homology and is 
therefore boxed with a dotted line. The notation "del 
220" refers to an approximately 220 base pair 
deletion/rearrangement affecting the second gene {Bp4B) 

15 in this clone. Start of transcription is located at the 
leftmost side of each boxed area (except in the case of 
gene Bp4B) and exon and intron positions are noted by the 
exons being filled in with black and the intron positions 
being left unfilled. A small arrowhead is shown on the 

2 0 non- transcribed 5' region of each gene, this arrowhead 
serves to indicate the promoter region of each gene. 
Restriction sites are identified such that the number of 
the first nucleotide of the restriction enzyme 
recognition site is shown. Not all restriction sites are 

25 shown, only those relative to the constructs detailed 
within are indicated. The genes are presented with the 
5' region being on the left side and the 3' region being 
on the right. The numbering of the DNA sequence in all 
cases starts from left to right, 5' to 3 ' . 

30 In Figure 3a, the DNA sequence of the clone L4 is 

shown, the orientation of the sequence and the genes 
contained within the microspore specific clone from 
Brassica napus are from 5' to 3 ' . In Figure 3a, clone 
L4, nucleotide 1 in the complete sequence is at the left- 

35 most EcoRI site while nucleotide 8579 is at the first 
nucleotide of the right -most EcoRI site. The start of 
transcription of gene 1 in clone L4 is nucleotide 235. 
The 5' and 3' intron splice sites are identified in 
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boldface type. The start ATG codon is shown as well as 
the stop termination codon. The deduced amino, acid 
sequence of the proteins encoded for by these genes are 
also shown. The end of transcription for gene 1 is 
approximately nucleotide 1427. As indicated above the 
second gene in clone L4 is most likely non- functional due 
to an insertion and a deletion that occurs in the region 
of the promoter and first exon. This gene was not 
utilized for constructs. The third gene in clone L4 has 
a transcriptional start at position number S2 98 in the 
DNA sequence and transcription ends at approximate.', y 
nucleotide 7490. The ATG start codon, intron splice 
sites and termination stop codon are all identified as 
above. Vectors were constructed from this clone by using 
promoter fragments from both genes 1 and 2 . The specific 
promoter fragment constructs are detailed below. 

The construction of 2 vectors containing promoter and 
promoter fragments from the clone L4 was carried out as 
follows and as shown in Figure 7 (A,B,C,D) . The first 
vector was constructed by first subcloning the EcoRi-SstI 
(nucl. 1-2132) fragment containing the first gene of 
clone L4 (23 5 base pairs of promoter/exon/intron/ second 
exon) in the commercially available vector pGEM-4Z 
(Promega Biotech, Madison, WI, U.S.A.) using the EcoRI- 
Sstl sites of the polylinker of this vector. This 
plasmid was named pPAL0402. The 2 . 7 Kb EcoRI fragment of 
clone L4 that contains the third gene (Bp4C) was then 
cloned into the EcoRI site of pGEM 4Z, leading to a 
plasmid called pPAL0411. The plasmid pPAL0402 was then 
digested with EcoRI and the 2 . 7 Kb EcoRI fragment from 
PPAL0411 (nucl. 5859-8579) that contains the gene number 
three (Bp4C) from clone L4 was added to it. Clones were 
recovered that contained this inserted 2.7 Kb EcoRI 
fragment in both orientations relative to the promoter 
region of the first gene. A clone that contained this 
third gene fragment in an orientation such that the 
promoter from the third gene was opposite to the promoter 
in Che first gene was chosen and called pPAL0403. 
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pPAL0403 contains the entire third gene from clone L4 
oriented in such a fashion as to have the promoter region 
immediately adjacent to the 23 5 base pair promoter region 
of the first gene in pPAL04 03 . This plasmid, pPAL0403 
5 was digested with Ddel. Ddel digestion produced a 
fragment of approximately 1.9 Kb. The Ddel sites are 
located at nucleotides 303 and 7366, Because of the 
orientation of these fragments, digestion with Ddel 
produces a 1.9 Kb fragment. This 1.9 Kb fragment 

10 contains a copy of the third gene (Bp4C) oriented such 
that thti direction of transcription of this third gene is 
from right to left, fused to the 235 base pair promoter 
fragment from the first gene of clone L4 (Bp4A) which is 
transcribed from left to right, ending in a Ddel site 

15 that is located 67 basepairs down stream of the major 
start site of transcription and proceeds the ATG start of 
translation codon by 2 nucleotides. This 1.9 Kb Ddel 
fragment made blunt with Klenow fragment and cloned into 
the Xbal site of the polylinker region of pGEM 4Z 

20 previously made blunt ended with Klenow fragment. The 
resultant plasmid pPAL0408, was recovered and 
subsequently was digested with Sail and SstI, which 
releases the cloned Ddel fragment bordered by on the left 
hand side, (nucl 7366) Sail and on the right hand side 

25 (nucl 303) of this construct and contains a portion of 
the polylinker of pGEM 42 containing the following unique 
sites: BamHI , Smal, Kpnl, and SstI restruction enzyme 
sites. This Sail - SstI fragment was cloned into the 
Sail - SstI sotes of PALlOOl, PALlOOl is the binary 

30 vector Binl9 (described by Bevan, M. , Nucleic Acids Res., 
1984, 12:8711-8721) to which has been added the nor ter 
polyadenylation signal as a 260 bp SstI - EcoRI fragment 
isolated from the plasmid pRAJ221 (available from 
Clonetech Laboratories, Palo Alto, CA, U.S.A.) in the 

35 SstI - EcoRI sites of the polylinker region of Binl9 . 

The nos ter is identified as a stippled box. The binary 
transformation vector that resulted from the insertion of 
the Sail - SstI fragment of pPAL04 08 into PALlOOl was 
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named PAL1107. The details of the construction are shown 
in Figure 7A. This vector has a copy of the third gene 
oriented such that the direction of transcription of this 
third gene is"from right to left, fused to the 23 5 base 
pair promoter fragment from the first gene of clone L4 
which is transcribed from left to right, followed by a 
polylinker with unique sites for the insertion of DNA 
which consist of: BamHI , Smal, Kpnl and SstI followed by 
the nos ter. This vector has the feature in that 
additional 5' non-coding sequences were placed upstream 
to- the 235 -base pair core promoter on Bp4A, but these* 
additional 5' sequences- were in an opposite orientation. 
The provision of these sequences in this orientation does 
not affect the pollen specificity of the core 235 base 
pair promoter. With this vector, one can also ascertain 
for position effects of the transformation process in 
that since the vector contains an intact copy of the Bp4C 
gene, probing for the levels of expression of the Bp4C 
gene can give an indication as to what levels of 
expression from the other promoters essential for pollen 
development and/or .function in PAL1107 can be expected. 
Example 3 8 

This example describes the construction of additional 
vectors having the promoter regions of the genes 
contained in clone L4 whcih may be useful for pollen 
specific expression of gene sequences. The three genes 
within clone L4 {Bp4A, Bp4B, Bp4C) show very near-exact 
DNA homology and this is most apparent between the first 
(Bp4A) and third (Bp4C) gene. The second gene (Bp4B) is 
a homologous copy that has undergone sequence changes 
that have appear to have lead to inactivation of it. The 
extensive similarity between the first, second and third 
genes in clone L4 is also maintained in the promoter 
region such that out of the first 235 nucleotides of the 
first and third gene promoter regions there are only 5 
nucleotides that differ between them. Downstream of the 
TATA box m these two promoters the only difference 
between them is the presence of one additional nucleotide 
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at the start of transcription (e.g.. Promoter l,Bp4A: 

TATGTTTt A AAA . . . versus promoter 3 , Bp4c : 

TATGTTTAAAA (transcribed region underlined, 

single nucleotide difference shown in lower case) . 
5 However, within the sequence of the first gene there is 
a nucleotide change that introduces a Ddel site (nucl 
3 03) in the untranslated 5' leader sequence upstream of 
the ATG start codon that is not present in the 
untranscribed leader secjuence of the third gene in clone 

10 L4 . For this reason, chimeric promoter constructs were 
made which utilized this Ddel site, la the first gene 
joined to sequences from the third geiie promoter. The 
region of the first promoter used for these constructs 
consisted of the sequences contained between the SnaBI 

15 site (nucl 210) near .the TATA box to the Ddel site 
located immediately upstream of the ATG start codon in 
the first gene (nucleotide 303 is the first nucleotide in 
the recognition sequence for Ddel) . The other region of 
this chimeric promoter (5' of the TATA box) was a 

20 fragment extending from the EcoRI site of the third 
promoter (nucleotide 5858) to the SnaBI site near the 
TATA box (nucleotide 6273) . Therefore to facilitate 
construction of these pollen specific vectors, the 
following reconstructions were performed. 

25 The EcoRI to Ddel fragment that encompasses the 

promoter region of the first gene in clone L4 was 
isolated by first cutting pPAL0402 with Ddel, blunting 
with Klenow, and then cutting with EcoRI. The 235 base 
pair fragment corresponding to this region was cloned 

30 into the EcoRI - Smal sites of pGEM 4Z, This plasmid 
(pPAL0422) , was then cut with EcoRI and SnaBI. A DNA 
fragment that contained the EcoRI to SnaBI portion of the 
promoter for gene 3 in clone L4 was isolated by digesting 
pPAL0411 with EcoRI and SnaBI. This released an 

35 approximately 415 base pair EcoRI (nucl 5858) to SnaBI 
(nucl 6273) fragment that represents most of the 5' 
region of the gene 3' promoter from clone L4 (the SnaBI 
recognition site is 2 base pairs downstream of the TATA 
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box) . This EcoRI - SnaBI fragment was used to replace 
the shorter EcoRI - SnaBI fragment removed from the first 
promoter subclone (pPAL04 22) , reconstructing a promoter 
fragment of approximately 550 base pairs. This plasmid 
is referred to as pPAL0421. This chimeric promoter 
fragment contains 415 base pairs of the promoter of gene 
three in clone L4, followed by approxitnately 99 
nucleotides of the first gene promoter/untranslated 
leader sequence . 

For construction of a pollen specific cassette 
vector, the following plasmids were first constructed. 
The first plasmid constructed contained the nos ter 
polyadenylation signal with a polylinker in frbnt of the 
nos ter. This was accomplished by first isolating from 
PRAJ221 the nos ter as a SstI - EcoRI fragment and this 
fragment was cloned in pGEM 42 using the SstI and EcoRI 
sites in the polylinker. This subclone is referred to as 
pPALOOl. To pPALOOl, a fragment coding for neomycin 
phosphotransferase (NPT II) derived from the plasmid 
pRAJl62 was added to it in the antisense orientation as 
follows: The plasmid pRAJ162 contains the NPT II gene 
from the transposon TN5 inserted as a Sail fragment and 
bounded by a polylinker in the plasmid pUC-9 and was 
obtained from the Plant Breeding Institute, Cambridge, 
U.K. pRAJlS2 was digested with Hindlll and Smal . The 
DNA fragment containing the NPT 'II gene was isolated by 
elution from an agarose gel. pPALOOl was digested with 
Hindi! I and Smal and the NPT II gene fragment was ligated 
into it. The resultant plasmid was called pPAL002 and 
had such orientation of restriction sites and the NPT II 
gene and nos ter as follows: Hindlll, PstI, Sail, 3' end 
NPT II coding sequence 5' end, Sail, BamHI, Smal, Kpnl , 
SstI, nos ter . EcoRI. pPAL002 was cut with Hindlll and 
the site made blunt ended by the use of Klenow fragment. 
pPAL0421 was digested with Hindi and pVUII, both of 
which leave blunt ends, and the promoter fragment was 
ligated into Hindlll cut blunt ended pPAL002. Plasmids 
were obtained that contained the promoter in both 



wo 96/40950 _ PCT/US96/08692 

- 132 - 



orientations relative to the nQs te£. One plasmid was 
chosen with the proper orientation (5' promoter/ 
antisense NPT Il/nos ter) and was named pPAL0419. 
PPAL0419 has the following DNA fragments: A small 
(approx. 130 bp) of pGEM 4Z that contains the SP6 
promoter, the 550 base pair chimearic promoter, the NPT 
II gene in the antisense orientation relative to the 
promoter, followed by the nos ter polyadenylation signal. 
This entire promoter/NPT Il/nas ter construct is 
excisable by EcoRI . In order to provide promoter 
sequences that could be utlized with additional gene 
constructs, the plasmid pPAL04l9 was digested with Sail. 
This digest removes the NPT II coding region and this 
Sail digested pPAL0419 was relegated giving rise to 
PPAL0402. PPAL0420 represents the promoter essential for 
pollen development and/or function followed by a 
polylinker for insertion of genes that has the following 
unique sites: Hindi, PstI, Sail, BamHI, Smal, Kpnl , 
SstI, followed by the nos t^r polyadenylation signal. 
The entire promoter/polylinker/nos ter construct can be 
conveniently excised as a single EcoRI fragment. The 
details of this construct is shown in Figure 7B . This 
plasmid was used for the construction of an additional 
promoter essential for pollen development and/or function 
in a binary transformation vector. The intact L4 clone 
in the lambda cloning vector was digested to completion 
with the restriction enzymes SstI and Hhal. The 
resultant fragments were separated by gel electrophoresis 
and a 2.65 Kb fragment that contains the promoter/first 
exon/intron/partial second exon region of gene three in 
clone L4 and corresponds to nucleotides 4565 to 7216 in 
the sequence of clone L4 was isolated. This fragment was 
made blunt ended with Klenow and cloned into the binary 
transformation vector PALlOOl previously described with 
reference to Figure 7a. PALlOOl was first cut with 
Hindu I and made blunt ended with Klenow. Clones 
containing this fragment ( promoter / firs t 
exon/intron/partial second exon) were recovered. A clone 
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was chosen that contained this fragment in the proper 
orientation such that the direction of transcription was 
towards the nos ter in PALIOOI. This vector was named 
PAL1421. This vector contains approximately 1.9 kb of 
upstream promoter region from the gene 3 in clone L4 
followed by the first exon, the complete intron and 15 
bases of the second exon of gene three followed by a 
polylinker containing the following unique sites: Sail, 
Xbal, BamHI, Smal, Kpnl, SstI, and finally the nos terti, 
polyadenylation signal. A varient of this vector was 
constructed bv digesting PAL1421 with EcoRI and replacing 
the promoter/ exon/intron/second exon/polyl inker/ nos tes; 
structure with the promoter/polylinker/nos ter structure 
from PPAL0420 using EcoRI such that a longer 5' promoter 
region is reconstructed in the binary transformation 
vector. The resultant vector was named PAL1423 . The 
outline of this construction is shown in Figure 7D. 
Example "^q 

In this example, a promoter essential for pollen 
development and/or function is used to synthesize the 
enzyme lamH specifically in pollen cells. The enzyme has 
activity that can cause the production of NAA from NAM, 
the substance NAA functioning as a plant hormone that is 
substantially toxic to developing pollen grains, while 
the precursor NAM being relatively non- toxic. For this 
example, the lamH gene was inserted into the vector 
PAL1423. The lamH gene was isolated from pPCV3ll as 
described in Figure 15 and cloned as a Sail fragment in 
the Sail site of PAL1423, creating PAL1426. This vector 
has the lamH gene (T-DNA gene 2) under the control of a 
promoter essential for pollen development and/or function 
from clone L4 in the sense orientation. PAL1426 was used 
to transform tobacco as outlined in Example 34. 
Example 3 9a 

In this example, constructs are made whereby the lamS 
(gene 1) or lamH (gene 2) are expressed from either the 
microspore-specific promoter Bpio or the anther-specific 
TA39. The four constructs are shown in Figure 23 
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(Plasmid pPHP5838, BplO:IamS), Figure 24 (pPHP5839, 
BplO:IamH) , Figure 25 (pPHP5840, TA39:IamS) and Figue 26 
(pPHP5841, TA39:IamH) . All four constructs are generated 
using as a backbone the plant transformation vector 
pALLTKREP (Boutilier, K,A. , Gines, M.-J,, DeMoor, J.M., 
Huang, B., Baszczynski, C.L., Iyer, V.N,, and Miki, B.L. 
Expression of the BnmNAP subfamily of napin genes 
coincides with the induction of Brassica microspore 
embryogenesis . 1994. Plant Molec . Biol, 26: 1711-1723). 
pALLTKREP was renamed at Pioneer and will be referred to 
elsewhere as PPHP1741, 

The relevent portions of pPHP1741 that are retained 
in the four Pioneer plasmids' include: (l) a wide host 
range bacterial vector, including a bacterially expressed 
kanamycin resistance gene as the selectable marker, (2) 
left and right T-DNA borders derived from Agrobacterium 
tumefaciens, and (3) between the T-DNA borders, a 
chimeric selectable marker gene for plant expression of 
resistance to kanamycin, comprising a CaMV 35S promoter 
operably linked to the NPTII open reading frame (ORF) and 
CaMV 19S 3' nontranslated region, which encodes signals 
requried for proper processing of the 3' end of the 
transcript. Note in the maps, the wide host range 
origins of replication are referred to as "OriV" and 
"OriT" (vegetative origin of replication and transfer 
origin of replication) ; the bacterial selectable marker 
is referred to as "S. F. KanR" ; the T-DNA left and right 
borders are referred to as "LB" and "RB", respectively; 
the CaMV 35S promoter is referred to as "35SPro"; and 
the CaMV 19S 3' region is referred to as "CT", 

In Figures 25 and 26, incorporated between the TA39 
promoter and the lAMS or lAMH genes is the nontranslated 
leader from Tobacco Mosaic Virus (TMV) , 0', (Gallie, 
D.R., Sleat, D.E., Watts, J.W. , Turner, P.C., and Wilson, 
T.M.A. 1987, The 5' -leader sequence of tobacco mosaic 
virus RNA enhances the expression of foreign gene 
transcripts in vitro and in vivo. Nucl. Acids Res. 15: 
3257-3273; Gallie, D.R., Sleat, D.E., Watts, J.W., 
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Turner, P.C., and Wilson, T.M.A.1987. A comparison of 
eukaruotic viral 5' -leader sequences as enhancers of mRNA 
expression in vivo. Nucl . Acids Res. 15: 8693-8711; 
Sleat, Hull, R. , Turner, P.C., and Wilson, T.M.A. 

5 1988. Studies on the mechanism of translational 
enhancement by the 5' -leader sequence of tobacco mosaic 
virus RNA. Eur. J. Biochem. 175: 75-86). 0' is our 
designation on the maps of Figures 25 and 26 for the TMV 
non translatad leader, which is known as "0'". 

10 The constructs also contain the 3' nontranslated 

region from either the nopaline synthase (nos) gene from 
Agrobacterium tumefaciens Ti plasmid pTiT3 7, or from the 
proteinase inhibitor II gene (PI -II) from Solanum 
tuberosum (potato) . Both are well-known in the art. The 

15 reference for the nos 3' region is: Sevan, M., Barnes, 
W.M., and Chilton, M.-D. 1983. Structure and 

transcription of the nopaline synthase gene region of T- 
DNA. Nucl. Acids Res. 11: 369-385. The reference for 
the PI-II 3' region is: An, G. , Mitra, A., Choi, H.K., 

20 Costa, M.A., An, K. , Thornburg, R.W., and Ryan, C.A. 

1989. Functional Analysis of the 3' Control Region of 
the Potato Wound- Inducible Proteinase Inhibitor II Gene. 
Plant Cell 1: 115-122. Note that in Figures 23-26, the 
nos 3' region is referred to as "NT" and the PI-II 3' 

25 region is referred to as "Pinll". 

Figures 27-30 list partial sequence information for 
plasmids pPHP5838, pPHP5839, pPHP5840 and pPHP5841 for 
which maps are provided in Figures 23-26, respectively. 
The nucleotide numbers indicated in Figures 27-3 0 are the 

30 same as the numbers on the maps of Figures 23-26. The 
sequences provided in Figures 27-30, extend from the left 
border region to the right border region of each plasmid. 
That part of the DNA sequence not included is common to 
all four plasmids and belongs to plasmid pPHP1741 

3 5 (pALLTKREP, described above) . 
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Example 4 0 

The vector PAL1426 was used to transform Brassica 
napus as described in Example 34 . 
Example 41 

In this example, the vector PAL1107 was used for the 
production of tissue-specific GUS (beta-glucuronidase) 
enzyme. The gene for this enzyme is available from 
Clonetech Laboratories, Palo Alto. CA. , U.S.A. The gene 
was inserted to PAL1107 as a BamHI - SstI fragment and 
was used to transform tobacco as described in Example 34. 
Plants produced had detectable GUS activity only in 
developing pollen cells, and not any other tissue tested. 
The application of a non-toxic analog of glucuronic acid 
to which has been conjugated a toxic molecule could be 
applied to these plants and cleavage of the toxic moiety 
from the glucuronic acid would occur only in pollen 
cells. This provides an example of an enzyme that could 
be used for the production, in a tissue-specific fashion, 
of a toxic substance from a non- toxic analog. One such 
analog that could be used is Glue -Camp, a glucuronic acid 
analog that is conjugated to chloramphenicol. When acted 
upon by glucoronidase , chloramphenicol is produced, 
inhibiting cellular growth and development. 
Example 4 2 

In this example, two isogenic plant lines (Al, A2) 
were produced that carried either the lamS or the lamH 
genes. Tobacco plants were transformed with PAL1426 
containing the lamH gene as in Example 39, producing the 
A2 line. The lamS gene described in Figure 16 was 
inserted as a Sail fragment into the vector PAL1423 in 
the sense orientation, giving rise to PAL1425. PAIil425 
was used to transform tobacco as described and tobacco 
plants were produced that carried PAL1425, These lines 
represented the Al lines. Tobacco plants that contained 
both PAL1426 and PAL1425 were selfed and homozygous Al. 
and A2 lines were selected. 
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Example 43 

In this example, PAL1426 (see Example 39) and PAL1425 
were used to transform Brassica napus . Plants lines 
homozygous for the Al and A2 genes were selected as in 
Example 42. 
Example 44 

Gene 1 (lamS) and gene 2 (lamH) constructs were 
introduced into different canola lines by transformation, 
as in example 34. Two different anther-specific 
promoters were used for each gene: TA39 from tobacco and 
BplO from Brassica constructed as described :in example 
39a and Figures 23-26. TO plants carrying each 
gene/promoter combination were recovered. The TO plants 
carrying gene 2 looked completely normal, and were 
completely male and female fertile. Some of the TO's 
carrying gene 1 were abnormal, showing huge flowers and 
not much pollen. The pollen from these plants was 
probably not fertile. In the crosses described above, 
the more normal looking gene 1 plants were generally 
used. Crosses were made between plants containing gene 
1 and gene 2. Eight Tl progeny which are PCR-positive 
for the presence of both gene 1 and gene 2 were 
identified. The following Tl combinations were obtained: 
TA39/gene 1 + TA39/gene 2 (2 plants); TA39/gene 1 + 
TA39/gene 2; TA39/gene 1 + TA39/gene 2; TA39/gene 1 + 
TA39/gene 2; BPlO/gene 1 + TA39/gene 2; B.PlO/gene 1 + 
TA39/gene 2 (2 plants) ; BPlO/gene 1 + BPlO/gene 2 (2 
plants); BPlO/gene 1 + BPlO/gene 2. The Tl progeny were 
male sterile and female fertile. The male sterile plants 
showed very reduced anthers and either lacked pollen or 
the pollen was brown colored. In some cases, the male 
sterile plants actually produced pods, but no seed was 
present. One of the Tl progenies only, when selfed, 
produced a limited amount of seed. Female fertility was 
confirmed by applying wild-type pollen on the plants 
which resulted in seed production. 
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It Is ClaimaH » 

1. A recombinant DNA molecule for use in preparing 
a male sterile plant of a pollen-bearing species that is 
capable of being transformed, said molecule comprising 

(A) a first DNA sequence coding for a first gene 
product that converts a first substance which 
is present in the plant into a second 
substance, wherein the first and second 
substances do not inhibit pollen formation or 
function, 

(B) a second DNA sequence coding for a second gene 
product that converts said second substance to 
a third product that inhibits pollen formation 
or function, 

(C) a first promoter that controls expression of 
said first DNA sequence in plants, 

(D) a second promoter that controls expression of 
said second DNA sequence in plants, and 

(E) a third DNA sequence that confers resistance to 
a selective agent . 

2. A recombinant DNA molecule according to claim 1, 
wherein said resistance is to a herbicide. 

3. A recombinant DNA molecule according to claim 2, 
wherein said herbicide is selected from the group 
consisting of glyphosate, glyphosinate and TBL. 

4. A recombinant DNA molecule according to claim 1, 
wherein at least one of said first and second promoters 
is specific to cells that are critical to pollen 
formation or function. 

5. A recombinant DNA molecule according to claim 4, 
wherein at least one of said first and second promoters 
is specific to anther cells. 

6. A recombinant DNA molecule according to claim 4, 
wherein at least one of said first and second promoters 
is specific to microspores. 

7. A recombinant DNA molecule according to claim 1, 
wherein said second promoter is an inducible promoter. 
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8. A recombinant DNA molecule according to claim 1, 
wherein said first DNA sequence is lamS and said second 
DNA sequence is lamH. 

^- A recombinant DNA molecule according to claim 1, 
5 wherein said second substance is 2 -amino-4 -methoxy- 
butanoic acid and said second expression product is 
methoxinine dehydrogenase or rhizobitoxine synthase. 

10. A recombinant DNA molecule according to claim 
1, wherein said second substance is a non- toxic analog of 

io glucuronic acid and said second expression product is 0- 
glucuronidase . 

11 . A recombinant DNA molecule according to claim 
1, wherein said second substance is napthalene acetamide 
or indole acetamide and said second expression product is 

15 indole acetamide hydrolase. 

12. A method of producing a male sterile plant, 
comprising the steps of 

(A) introducing a recombinant molecule according to 
claim 1 into the genome of a pollen producing plant which 

2 0 is capable of being genetically transformed; 
and 

(B) growing said pollen producing plant under 
conditions such that male sterility is achieved as a 
result of the expression of said first DNA sequence and 

25 said second DNA sequence, providing said male sterile 
plant . 

13. A method according to claim 12, further 
comprising, after step (B) , the steps of (C) crossing 
said male sterile plant with an fertile, isogenic plant 
to obtain seed, then (D) growing out said seed to produce 
progeny plants, and (E) selecting male sterile plants 
from among said progeny by exposure to said selective 
agent . 

14 . A method of producing hybrid seed from a male 
35 sterile plant, comprising the steps of 

(A) introducing a first recombinant molecule according 
to claim 1 into the genome of a pollen producing 
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plant which is capable of being genetically 
transformed; 

(B) growing said pollen producing plant under conditions 
such that male sterility is achieved as a result of 
the expression of said first DNA sequence and said 
second DNA sequence, yielding a male sterile plant; 
and 

(C) crossing said male sterile plant with pollen from a 
male fertile line, said pollen having integrated 
into its genome a second recombinant DNA molecule 
comprising a selective marker gene and a .promoter 
regulating expression of said selective marker gene 
in plants. 

15 . A method according to claim 14 , wherein said 
second recombinant DNA molecule further comprises (i) a 
third DNA sequence and (ii) a third promoter that causes 
transcription of said third DNA sequence in plants, 
wherein said third DNA sequence is transcribed as an 
antisense RNA or encodes a gene product that negates the 
effect of said first DNA sequence or said second DNA 
sequence, such that hybrid seed with restored fertility 
is produced by said male sterile plant, and said third 
promoter causes said transcription such that expression 
of said third. DNA sequence overlaps in time with 
expression of said first DNA sequence or said second DNA 
sequence . 

16. A method according to claim 15, wherein said 
third DNA sequence has an antisense RNA transcript. 

17. A method according to claim 15, wherein said 
third DNA sequence is a repressor gene. 

18. A method according to claim 15, wherein said 
third DNA sequence encodes a gene product that is a 
ribozyme . 

19. A method according to claim 15, wherein said 
selective marker genes imparts herbicide resistance. 

20. A method according to claim 15, wherein said 
third DNA sequence encodes a gene product that is a 
dominant negative transact ivator. 
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21. A method according to claim 15, wherein said 
third DNA sequence encodes a gene product that - is a 
transdominator . 

22. Hybrid seed produced by a method according to 
claim 15. 

23. A plant cell comprising elements (A) through (E) 
of a DNA molecule according to claim 1, 

24. A differentiated plant comprising a cell 
according to claim 23. 
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FIG. 27a 

^^^^ CTGAT GGGCTGCCTG TATCGAGTGG 

2 751 TGATTTTGTG CCGAGCTGCC GGTCGGGGAG CTGTTGGCTG GCTGGTGGCA 
2801 GGATATATTG TGGTGTAAAC AAATTGACGC TTAGACAACT TAATAACACA 
2851 TTGCGGACGT TTTTAATGTA CTGCCATTCG CCATTCAGGC TGCGCAACTG 
2901 TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 
2951 AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC 
3001 CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCGaattccc "aGATCTAGT 

3 051 AACATAGATG ACACCGCGCG CGATAATTTA TCCTAGTTTG CGCGCTATAT 
3101 TTTGTTTTCT ATCGCGTATT AAATGTATAA TTGCGGGACT CTAATCATAA 
3151 AAACCCATCT CATAAATAAC GTCATGCATT ACATGTTAAT TATTACATGC 
3 2 01 TTAACGTAAT TCAACAGAAA TTATATGATA ATCATCGCAA GACCGGCAAC 
3251 AGGATTCAAT CTTAAGAAAC TTTATTGCCA AATGTTTGAA CGATCTTGCA 
3301 CGTTGAGCTC GGTACCCGAT CCGTAGATTA ATTTCTATTG CGGTAGTTAT 
3 351 ATCTCTTCCA AGAGTGTTCG AGAGGATTGT CCTTTGCCAA AATACCTCCA 
34 01 CAATTGTGGA TAATTGCACA GACGGCGTTA CACGCGGTCT GAATAGCGCC 
3451 CTCCACCCAT CCACCGGTGA AGGAACAACT GCAACCCGCC AAGTAAACTC 
3501 CGGTATCATT AGTCATGTCC AGCGCTTGAA AGAAAAGTTC TTCAGAATAA 
3551 AAATCCTCGC CACGCCGGTT GAGTTTGAAA GCTCCCCCGG CATTCTCGTC 
3 601 TGTAAGCCAA TCATGTTGAA CAACATTTTG GTCGTAATCA GCGCAGGCGG 
3651 GAACTAGATG CTGGGCAAAC GCCGGGAAAG ATCTCGAAAT TGCGTCCCGC 
3701 AGCAGACAGA ATCGCTCTTT TTTGTCGGGA ACCGCCAACA GCTTGTGGGA 
3751 GTCGTCCTCC CATGTATAAC TGATGAGCAC CAGACCTTTA CCATTCGGgT 
3801 CCTGgGGCTC ATAGTCCAGG CAGTACACTG CTTTTGCGAT CCCGTCCATG 
3851 AGGACACAGG ACGGGAGGAT ATGGTCTAAC CAAAATTTTC GTTCAGTGAC 
3901 CAGGAAGAGT TTTGACGAGC CTGTCATATG GCTGTTATCA ACCGCTTGGT 
3951 TCACTCGTGC ACGAAAAATG GTGGTATCGC ATGTCAGACA ATGCCTGAGT 
4001 TGGATATTTG CGAGTCCAGA TGTAACCACC ACCTTATCAT AAAGTTCAGA 
4051 TATCCCGCTC TTAAGCCTTA TCTTTATTTT TGTCTTTTCT TTCTCAATCG 
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CTTTCTCGTA 


5101 


AGGCGGCATT 


TGCAGGGTTC 


GGGAGCGGGC AGTCTTAGAG ATTGAAATGG 


5151 


CCACGAAATG 


ACTAAGATCG 


CTGGATGCAC ATCTTTTACC 


GCTCAGCAAA 


5201 


AACAGGGTTT 


GGAGAAGTTG 


TGCTTCGGAG AAATCCATAC AAAATGGTGC 


5251 


AAGGTCGATG 


GCAACGAGAC 


CACTCACTCT TGCTCGCGCC 


CAAGGTTCCG 


5301 


ATTCAAGTAT 


CCGCCCCAGA ATTTCCTTGC CAACATAGAT 


GTAAGCGGAG 


5351 


AGGGCTGCTA 


CCTTCTCACC 


AGCTGAGATC TCGGGAAAGC GACCGTCGGC 


5401 


CAGCCTTTTG 


CAAGCTAACC 


CAGCGCTGCA CTCGCCGGAG ATTTGAGTAA 


5451 


TCCTCCTTCC 


CCTAGAAGCT TCACGTTCTG AGAAGGCATC 


GGAAACCCTG 
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FIG. 27c 

5501 CGGTCCAATT CATCAGCCTT ATCGACCATT ATCAGATCCA CCATTTTGGT 

5551 AGAGAAATGA TCGCACTGGT TATCAAGGAG AGCTGAAGCT GACATgGtGG 

5601 ATCCTCTAGA GTCTATGTGT TTTTGTGGGG TATTAATGTT TTATTTTTAT 

5651 TTGGTTATGT TTCGGGGTTT GAATTTCGAG TTGTGAAAAT GCTTGAGGAC 

5701 GTTTTTATAG GGTTTTTTAA GACTGCTTTT TCTGATTTTC TTTGGCATCT 

5751 CCTCTCATTT CGCTATTCAT TTCCATATTT GGGAACATTG AAGTTTAACC 

5 SOI TCTCGCTATT ATTAGTGTTT TTAGTTTCTC TTAACATTTT TCTCCCTTTT 

5j51 CTAATTTTTG GCTTTGGTTT CCATGTCTTT GAGAATTTGG CTATTTTGGA 

5901 TTTTAAAAGA ATACTCTGAG AGAGAAAAAT CTATTAGTCA CGTGAATAAG 

5951 TTATAGACAT AAACTTGAAT GTATTTCAAG AAGTTCACAT GCAAACTGAT 

6001 AAATTTACAG AACTATTATT ATTATTATTA TTATTATTAT TAGTTTTTTT 

6051 GGTTAAACAC TCTTTTTTTT TGTCAGACCT GCAGGCATGC AAGCTTGGCG 

6101 TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 

6151 TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 

6201 AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC 

6251 CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC 

6301 GGGGAGAGGC GGTTTGCGTA TTGGGCGCTA TCGGGGGATC TATAGGGACT 

6351 TTAGGTGATC TGGATTTTAG TACTGGATTT TGGTTTTAGG AATTAGAAAT 

6401 TTTATTGATA GAAGTATTTT ACAAATACAA ATACATACTA AGGGTTTCTT 

6451 ATATGCTCAA CACGTGAGCG AAACCCTATA AGAACCCTAA TTCCCTTATC 

6501 TGGGAACTAC TCACACATVT ATTATGGAGA AAATAGAGAG AGATAGATTT 

6551 GTAGAGAGAG ACTGGTGATT TCGAACCCCA GAGTCCCGCT CAGAAGAACT 

6601 CGTCAAGAAG GCGATAGAAG GCGATGCGCT GCGAATCGGG AGCGGCGATA 

6651 CCGTAAAGCA CGAGGAAGCG GTCAGCCCAT TCGCCGCCAA GCTCTTCAGC 

6701 AATATCACGG GTAGCCAACG CTATGTCCTG ATAGCGGTCC CCCACACCCA 

6751 GCCGGCCACA GTCGATGAAT CCAGAAAAGC GGCCATTTTC CACCATGATA 

6801 TTCGGCAAGC AGGCATCGCC ATGGGTCACG ACGAGATCCT CCCCGTCGGG 

6851 CATGCCCGCC TTCAGCCTCC CGAACAGTTC GGCTGGCCCG AGCCCCTCAT 
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c 1 

r I 


27d 




6901 


GCTCTTCGTC 


CAGATCATCC 


TGATCGACAA 


GACCGGCTTC CATCCGAGTA 


6951 


CGTGCTCGCT 


CGATGCGATG 


TTTCGCTTGG 


TGGTCGAATG GGCAGGTAGC 


7001 


CGGATCAAGC 


GTATGCAGCC 


GCCGCATTGC 


ATCAGCCATG ATGGATACTT 


7051 


TCTCGGCAGG 


AGCAAGGTGA 


GATGACAGGA 


GATCCTGCCC CGGCACTTCG 


7101 


CCCAATAGCA 


GCCAGTCCCT 


TCCCGCTTCA 


GTGACAACGT CGAGCACAGC 


7151 


TGCGCAAGGA 


ACGCCCGTCG 


TGGCCAGCCA 


CGATAGCCGC GCTGCCTCGT 


7201 


CCTGCAGTTZ 


ATTCAGGGCA 


CCGGACAGGT 


CGGTCTTGAC AAAAAGAACC 


7251 


GGGCGCCCCT 


GCGCTGACAG 


CCGGAACACG 


GCGGCATCAG AGCAGCCGAT 


7301 


TGTCTGTTGT 


GCCCAGTCAT 


AGCCGAATAG 


CCTCTCCACC CAAGCGGCCG 


7351 


GAGAACCTGC 


GTGCAATCCA 


TCTTGTTCAA 


TCATGCGAAA GGATCCTCAG 


7401 


CCTGTCTCTT 


GATCTCCTCT 


CCAAATGAAA 


TGAACTTCCT TATATAGAGG 


7451 


AAGGGTCTTG 


CGAAGGATAG 


TGGGATTGTG 


CGTCATCCCT TACGTCAGTG 


7501 


GAGATATCAC 


ATCAATCCAC 


TTGCTTTGAA 


GACGTGGTTG GAACGTCTTC 


7551 


TTTTTCCACG 


ATGCTCCTCG 


TGGGTGGGGG 


TCCATCTTTG GGACCACTGT 


7601 


CGGCAGAGGC 


ATCTTGAACG 


ATAGCCTTTC 


CTTTATCGCA ATGATGGCAT 


7651 


TTGTAGGTGC 


CACCTTCCTT 


TTCTACTGTC 


CTTTTGATGA AGTGACAGAT 


7701 


AGCTGGGCAA 


TGGAATCCGA 


GGAGGTTTCC 


CGATATTACC CTTTGTTGAA 


7751 


AAGTCTCAAT 


AGCCCTTTGG 


TCTTCTGAGA 


CTGTATCTTT GATATTCTTG 


7801 


GAGTAGACGA 


GAGTGTCGTG 


CTCCACCATG 


TTGACGAAGA TTTTCTTCTT 


7851 


GTCATTGAGT 


CGTAAAAGAC 


TCTGTATGAA 


CTGTTCGCCA GTCTTCACGG 


7901 


CGAGTTCTGT 


TAGATCCTCG 


ATCTGAATTT 


TTGACTGCAT GGCCTTTGAT 


7951 


TCAGTAGGAA 


CTACTTTCTT 


AGAGACTCCA 


ATCTCTATTA CTTGCCTTGG 


8001 


TTTATGAAGC 


AAGCCTTGAA 


TCGTCCATAC 


TGGAATAGTA CTTCTGATCT 


8051 


TGAGAAATAT 


ATCTTTCTCT 


GTGTTCTTGA 


TGCAGTTAGT CCTGAATCTT 


8101 


TTGACTGCAT 


CTTTAACCTT 


CTTGGGAAGG 


TATTTGATCT CCTGGAGATT 


8151 


ATTACTCGGG 


TAGATCGTCT 


TGATGAGACC 


TGCCGCGTAG GCCTCTCTAA 


8201 


CCATCTGTGG 


GTCAGCATTC 


TTTCTGAAAT 


TGAAGAGCCT AATCTTCTCA 


8251 


TTATCGGTGG 


TGAACATGGT 


ATCGTCACCT 


TCTCCGTCGA ACTTTCTTCC 
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FIG. 278 



8301 


TAGATCGTAG AGATAGAGAA AGTCGTCCAT 


GGTGATCTCC 


GGGGCAAAGG 


8351 


AGATCCTCTA GCTGATAGTG ACCTTAGGCG 


ACTTTTGAAC 


GCGCAATAAT 


8401 


GGTTTCTGAC GTATGTGCTT AGCTCATTAA 


ACTCCAGAAA 


CCCGCGGCTG 


8451 


AGTGGCTCCT TCAATCGTTG CGGTTCTGTC 


AGTTCCAAAC 


GTAAAACGGC 


8501 


TTGTCCCGCG TCATCGGCGG GGGTCATAAC 


GTGACTCCCT 


TAATTCTCCG 


8551 


CTCATGATCA GCGGCTGAGT GGCTCCTTCA ACGTTGCGGT 


TCTGTCAGTT 


8601 


CCAAACGTAA AACGGCTTGT CCCGCGTCAT 


CGGCGGGGGT 


CATAACGTGA 


8651 


CTCCCTTAAT TCTCCGCTCA TGATCAGATT 


GTCGTTTCCC 


GCCTTCAGTT 


8701 


TAAACTATCA GTGTTTGACA GGATATATTG 


GCGGGTAAAC 


CTAAGAGAAA 


8751 


AGAGCGTTTA TTAGAATAAT CGGATATTTA 


AAAGGGCGTG 


AAAAGGTTTA 


8801 


TCCGTTCGTC CATTTGTATG TGCATGCCAA CCACAGG 
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FIG. 28a 

^"^^^ • CTGAT GGGCTGCCTG TATCGAGTGG 

2751 TGATTTTGTG CCGAGCTGCC GGTCGGGGAG CTGTTGGCTG GCTGGTGGCA 

2 801 GGATATATTG TGGTGTAAAC AAATTGACGC TTAGACAACT TAATAACACA 

2 851 TTGCGGACGT TTTTAATGTA CTGCCATTCG CCATTCAGGC TGCGCAACTG 
2901 TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 
2951 AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC 

3 001 CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCGaattccc caGATCTAGT 
3 051 AACATAGATG ACACCGCGCG CGATAATTTA TCCTAGTTTG CGCGCTATAT 
3101 TTTGTTTTCT ATCGCGTATT AAATGTATAA TTGCGGGACT CTAATCATAA 
3151 AAACCCATCT CATAAATAAC GTCATGCATT ACATGTTAAT TATTACATGC 
3 201 TTAACGTAAT TCAACAGAAA TTATATGATA ATCATCGCAA GACCGGCAAC 
3 251 AGGATTCAAT CTTAAGAAAC TTTATTGCCA AATGTTTGAA CGATCTTGCA 
3 3 01 CGTTgAgcTc aCGTGGCATA CAGCTAATTA AAAGCATCGG GAAAGGAAGG 
3 3 51 AAAATTTATG GCTTTTTCTA ATGCTGCCCC GATTGCTAAC AGACGGTGGT 
34 01 CTGACCCCGC TAATCCATCA ATTTCCATTC CAACAGGCAA GCGATCAGGT 
34 51 GTAAGGCAGG CAGGAAGGCT CAACCCAGGT AGGCCTGCGT TGCTGCTTGG 
3 501 GTCCACATTT CGCACGTAGA TCTTGAAAGT GTTCATCATT GAGCCATTGT 
3 551 GGATGACTGA CGACTCCTGA CCTATGGCTT TGGCCGCTAA GGGTGCAGTT 
3 601 GGGAAAAGGA TTGCATCTAA CTGATAGAGT CTGAAGTAAT TCCGATAAGT 
3651 GGCCTGGAGC CTTGGCCTGA AGGATTGACG CGCCAGTTCA TATTCATCGT 
3701 TGGAAATTTG ATGCCCATCA ATTTGCGCAC TGACAATGTT CGCTACATCG 
3751 GGGCTACGtA TaCCTTTGAT AACGTCAGAA AAAGAAACTG TTCCCACAAA 
3801 ATCGTCGAGA TACTTTTTTA GAGCGTGTGG AAATTCGTAA AGCGCAATTG 
3851 GCAAACTTGC CCCACTATTC AGTTCCTCTA GGTGGGGGAT GTCGGCTTCA 
3 901 ACAAAGGTTA CGCCTCTGTT GGCTAGCAAG CGAATCGTCG TTTCAGCTGC 

3 951 GAAGGCCACA TCAGCATCAA GGTCATCGTA AAAGTAGGTA GTGGGGACGC 
4001 CGATCCGAAG CCCCTTCAGC GGCATCGGTC AAATTTTCGC CGACCGTCCG 

4 051 GAAATCACCT CGTCGAGGAT TATAACATCG GCTACGCACT GCGCTATGAT 
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FIG. 28b 

4101 TCCGGCGGTG TCCCGGGTGG GGCTGACCGG TATTATCCGA TCTCTTGGAT 

4151 ATCGAGCAAG CGTCGGTCGA AATCCTACTA CGCCACACAG GGCTGCGGGT 

42 01 AGGCGAACAG ATGCACCGGT ATCGGTGCCT ATGCCGCCTA ACATCAATCG 
4251 GCTTGCCACC GCAGCAGCCA CACCACCGCT TGAGCCTCCT GGTATCAGAC 

43 01 TTGGATTCCA CGGGTTCCGC ACCGCACCGG TGGCATAGTT GTTGCTCGTA 

43 51 ATTCCAAACG ATAACTCATG CATGTTTCCC GAGGCACCCG GCAGTGCTCC 

44 01 AGCTGAAAAA AGTCTTTCTG CGACGCGGGA TGGTATCTTT GGCAAGTGGT 
4451 TTATCAGCGC CGGAGTAGCA GCGCTTGTAG GAAATATGCC GGTCGCGATG 
4501 TTCGCCTTAA AACAGAGTGG AATGCCGCAA AGACCTAATC CGGCGTTTCC 
4551 ATGACGATCA ATTTTTTTGG CGCTTCGCCG CAAGCCATCC CAGTCTGTAG 
4 601 CCAGAAGGGC ATTTAATGGT TTTGCAGCTT GGCAACGCGC TATCAGAGTT 
4651 TCTACTAGTT CTAAGCAGGA GTAGTCTTTC CGTCTCAGGC GTTCTAGGGT 
4701 TTGTGCTAAC GAGGTAATGG GCACCATggT ggatccTCTA GAGTCTATGT 
4751 GTTTTTGTGG GGTATTAATG TTTTATTTTT ATTTGGTTAT GTTTCGGGGT 
4801 TTGAATTTCG AGTTGTGAAA ATGCTTGAGG ACGTTTTTAT AGGGTTTTTT 
4851 AAGACTGCTT TTTCTGATTT TCTTTGGCAT CTCCTCTCAT TTCGCTATTC 
4901 ATTTCCATAT TTGGGAACAT TGAAGTTTAA CCTCTCGCTA TTATTAGTGT 
4951 TTTTAGTTTC TCTTAACATT TTTCTCCCTT TTCTAATTTT TGGCTTTGGT 
5001 TTCCATGTCT TTGAGAATTT GGCTATTTTG GATTTTAAAA GAATACTCTG 
5051 AGAGAGAAAA ATCTATTAGT CACGTGAATA AGTTATAGAC ATAAACTTGA 
5101 ATGTATTTCA AGAAGTTCAC ATGCAAACTG ATAAATTTAC AGAACTATTA 
5151 TTATTATTAT TATTATTATT ATTAGTTTTT TTGGTTAAAC ACTCTTTTTT 
5201 TTTGTCAGAC CTGCAGGCAT GCAAGCTTGG CGTAATCATG GTCATAGCTG 
5251 TTTCCTGTGT GAAATTGTTA TCCGCTCACA ATTCCACACA ACATACGAGC 
5301 CGGAAGCATA AAGTGTAAAG CCTGGGGTGC CTAATGAGTG AGCTAACTCA 
5351 CATTAATTGC GTTGCGCTCA CTGCCCCCTT TCCAGTCGGG AAACCTGTCG 
5401 TGCCAGCTGC ATTAATGAAT CGGCCAACGC CCGGGGAGAG GCGGTTTCCG 
5451 TATTGCGCGC TATCGGGGGA TCTATACGGA CTTTAGCTGA TCTGGATTTT 
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FIG. 28c 



5501 


AGTACTGGAT TTTGGTTTTA GGAATTAGAA ATTTTATTGA TAGAAGTATT 


5551 


TTACAAATAC 


AAATACATAC 


TAAGGGTTTC TTATATGCTC 


AACACGTGAG 


5601 


CGAAACCCTA TAAGAACCCT AATTCCCTTA TCTGGGAACT ACTCACACAT 


5651 


VTATTATGGA 


GAAAATAGAG 


AGAGATAGAT TTGTAGAGAG 


AGACTGGTGA 


5701 


TTTCGAACCC 


CAGAGTCCCG 


CTCAGAAGAA CTCGTCAAGA 


AGGCGATAGA 


5751 


AGGCGATGCG 


CTGCGAATCG 


GGAGCGGCGA TACCGTAAAG CACGAGGAAG 


5801 


CGGTCAGCCC 


ATTCGCCGCC 


AAGCTCTTCA GCAATATCAC: 


GGGTAGCCAA 


5851 


CGCTATGTCC 


TGATAGCGGT 


CCGCCACACC CAGCCGGCCA 


CAGTCGATGA 


5901 


ATCCAGAAAA 


GCGGCCATTT 


TCCACCATGA TATTCGGCAA 


GCAGGCATCG 


5951 


CCATGGGTCA 


CGACGAGATC 


CTCGCCGTCG GGCATGCGCG 


CCTTGAGCCT 


6001 


GGCGAACAGT 


TCGGCTGGCG 


CGAGCCCCTG ATGCTCTTCG 


TCCAGATCAT 


6051 


CCTGATCGAC 


AAGACCGGCT 


TCCATCCGAG TACGTGCTCG 


CTCGATGCGA 


6101 


TGTTTCGCTT 


GGTGGTCGAA TGGGCAGGTA GCCGGATCAA 


GCGTATGCAG 


6151 


CCGCCGCATT 


GCATCAGCCA 


TGATGGATAC TTTCTCGGCA 


GGAGCAAGGT 


6201 


GAGATGACAG 


GAGATCCTGC 


CCCGGCACTT CGCCCAATAG 


CAGCCAGTCC 


6251 


CTTCCCGCTT 


CAGTGACAAC 


GTCGAGCACA GCTGCGCAAG 


GAACGCCCGT 


6301 


CGTGGCCAGC 


CACGATAGCC 


GCGCTGCCTC GTCCTGCAGT 


TCATTCAGGG 


6351 


CACCGGACAG 


GTCGGTCTTG 


ACAAAAAGAA CCGGGCGCCC 


CTGCGCTGAC 


6401 


AGCCGGAACA 


CGGCGGCATC 


AGAGCAGCCG ATTGTCTGTT 


GTGCCCAGTC 


6451 


ATAGCCGAAT 


AGCCTCTCCA 


CCCAAGCGGC CGGAGAACCT 


GCGTGCAATC 


6501 


CATCTTGTTC 


AATCATGCGA 


AAGGATCCTC AGCCTGTCTC 


TTGATCTCCT 


6551 


CTCCAAATGA AATGAACTTC 


CTTATATAGA GGAAGGGTCT 


TGCGAAGGAT 


6601 


AGTGGGATTG 


TGCGTCATCC 


CTTACGTCAG TGGAGATATC 


ACATCAATCC 


6651 


ACTTGCTTTG 


AAGACGTGGT 


TGGAACGTCT TCTTTTTCCA 


CGATGCTCCT 


6701 


CGTGGGTGGG 


GGTCCATCTT 


TGGGACCACT GTCGGCAGAG 


GCATCTTGAA 


6751 


CGATAGCCTT 


TCCTTTATCG 


CAATGATGGC ATTTGTAGGT 


GCCACCTTCC 


6801 


TTTTCTACTG 


TccrrrTGAT 


GAAGTGACAG ATAGCTGGGC 


AATGGAATCC 


6851 


GAGGAGGTTT 


CCCGATATTA 


CCCTTTGTTG AAAAGTCTCA 


ATAGCCCTTT 
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FIG. 28d 



6901 


GGTCTTCTGA 


GACTGTATCT 


TTGATATTCT 


TGGAGTAGAC 


GAGAGTGTCG 


6951 


TGCTCCACCA TGTTGACGAA 


GATTTTCTTC 


TTGTCATTGA 


GTCGTAAAAG 


7001 


ACTCTGTATG 


AACTGTTCGC 


CAGTCTTCAC 


GGCGAGTTCT 


GTTAGATCCT 


7051 


CGATCTGAAT 


TTTTGACTCC 


ATGGCCTTTG 


ATTCAGTAGG 


AACTACTTTC 


7101 


TTAGAGACTC 


CAATCTCTAT 


TACTTGCCTT 


GGTTTATGAA 


GCAAGCCTTG 


7151 


AATCGTCCAT 


ACTGGAATAG 


TACTTCTGAT 


CTTGAGAAAT 


AT AT CTTT CT 


7201 


CTGTGTTCTT 


GATGCAGTTA GTCCTGAATC 


TTTTGACTGC 


ATCTTTAACC 


7251 


TTCTTGGGAA 


GGTATTTGAT 


CTCCTGGAGA 


TTATTACTCG 


GGTAGATCGT 


7301 


CTTGATGAGA 


CCTGCCGCGT 


AGGCCTCTCT 


AACCATCTGT 


GGGTCAGCAT 


7351 


TCTTTCTGAA 


ATTGAAGAGG 


CTAATCTTCT 


CATTATCGGT 


GGTGAACATG 


7401 


GTATCGTCAC 


CTTCTCCGTC 


GAACTTTCTT 


CCTAGATCGT 


AGAGATAGAG 


7451 


AAAGTCGTCC 


ATGGTGATCT 


CCGGGGCAAA 


GGAGATCCTC 


TAGCTGATAG 


7501 


TGACCTTAGG 


CGACTTTTGA ACGCGCAATA 


ATGGTTTCTG 


ACGTATGTGC 


7551 


TTAGCTCATT 


AAACTCCAGA 


AACCCGCGGC 


TGAGTGGCTC 


CTTCAATCGT 


7601 


TGCGGTTCTG 


TCAGTTCCAA ACGTAAAACG 


GCTTGTCCCG 


CGTCATCGGC 


7651 


GGGGGTCATA 


ACGTGACTCC 


CTTAATTCTC 


CGCTCATGAT 


CAGCGGCTGA 


7701 


GTGGCTCCTT 


CAACGTTGCG 


GTTCTGTCAG 


TTCCAAACGT 


AAAACGGCTT 


7751 


GTCCCGCGTC 


ATCGGCGGGG 


GTCATAACGT 


GACTCCCTTA 


ATTCTCCGCT 


7801 


CATGATCAGA TTGTCGTTTC 


CCGCCTTCAG 


TTTAAACTAT 


CAGTGTTTGA 


7851 


CAGGATATAT TGGCGGGTAA ACCTAAGAGA AAAGAGCGTT 


TATTAGAATA 


7901 


ATCGGATATT 


TAAAAGGGCG 


TGAAAAGGTT 


TATCCGTTCG 


TCCATTTGTA 


7951 


TGTGCATGCC AACCACAGG 
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FIG. 29a 

^^^^ CTGAT GGGCTGCCTG TATCGAGTGG 

2751 TGATTTTGTG CCGAGCTGCC GGTCGGGGAG CTGTTGGCTG GCTGGTGGCA 
2801 GGATATATTG TGGTGTAAAC AAATTGACGC TTAGACAACT TAATAACACA 
2851 TTGCGGACGT TTTTAATGTA CTGCCATTCG CCATTCAGGC TGCGCAACTG 
2901 TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 
2951 AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC 
3 001 CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCGAATTCGC AAAACACACC 
3 051 TAGACTAGAT TTGTTTTGCT AACCCAATTG ATATTAATTA TATATGATTA 
3101 ATATTTATAT GTATATGGAt TTGGTTAATg AAATGCATCT GGTTCATCAA 
3151 AGAATTATAA AGACACGTGA CATTCATTTA GGATAAGAAA TATGGATGAT 
3201 CTCTTTCTCt tTTATTCAGA TAAcTAGTAA TTACACATAA CACACAACTT 
3251 TGATGCCCAC ATTATAGTGA TTAGCATGTC ACTATGTGTG CATCCTTTTA 
3301 TTTCATACAT TAATTAAgTT GGCCAATCCA GAAGATGGAC AAGTCTAGGt 
3351 TGGGTACCGA GCTCGGTACC CGATCCGTAG ATTAATTTCT ATTGCGGTAG 
3401 TTATATCTCT TCCAAGAGTG TTCGAGAGGA TTGTCCTTTG CCAAAATACC 
3451 TCCACAATTG TGGATAATTG CACAGACGGC GTTACACGCG GTCTGAATAG 

3 501 CGCCCTCCAC CCATCCACCG GTGAAGGAAC AACTGCAACC CGCCAAGTAA 
3551 ACTCCGGTAT CATTAGTCAT GTCCAGCGCT TGAAAGAAAA GTTCTTCAGA 
3601 ATAAAAATCC TCGCCACGCC GGTTGAGTTT GAAAGCTCCC CCGGCATTCT 
3651 CGTCTGTAAG CCAATCATGT TGAACAACAT TTTGGTCGTA ATCAGCGCAC 
3701 GCGGGAACTA GATGCTGGGC AAACGCCGGG AAAGATCTCC AAATTGCGTC 
3751 CCGCAGCAGA CAGAATCGCT CTTTTTTGTC GGGAACCGCC AACAGCTTGT 
3801 GGGAGTCGTC CTCCCATGTA TAACTGATGA GCACCAGACC TTTACCATTC 
3851 GGgTCCTGgG GCTCATAGTC CAGGCAGTAC ACTGCTTTTG CGATCCCGTC 
3901 CATGAGGACA CAGGACGGGA GGATATGGTC TAACCAAAAT TTTCGTTCAG 
3951 TCAGCAGGAA GAGTTTTGAC GAGCCTGTCA TATGGCTGTT ATCAACCGCT 
4001 TGGTTCACTG GTGCACGAAA AATGGTGGTA TCGCATGTCA GACAATGCCT 

4 051 GAGTTGGATA TTTGCGAGTC CAGATGTAAC CACCACCTTA TCATAAACTT 
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FIG. 29b 

4101 CAGATATCCC GCTCTTAAGC CTTATCTTTA TTTTTGTCTT TTCTTTCTCA 
4151 ATCGCCCTGA CTTGAACATG GCGTATACGC TGGCTTACAG ACACACCGTT 
4 2 01 AACCACTTGA GTGGCTATTC GACGTGGAAG TTCTGAGATT CCTTCAGAGC 
4 2 51 ACATCCGCTG ATTTTCTTCA TATCCGTTTA TGACCAAGCG AAGGATCTCA 
4 3 01 ATAAACCCGC TTTCAAAAAC TGGACCAAAC CCGCCAGATC CTATTCCCAT 
4 351 TAGCTTGAAT AGGTCCCAAT CATGAGGGAA ACTCCATG-^T TCACCACCAG 
44 01 GAGGATGCGT GCCCAGAAAG ATCCTCTCTA TCGCTGAAUA GAAGGACTCC 
44 51 CTCCCGAAAC GGTTCAGCCA AGTTTGCCAG GAGTCATGAG CCCGCCTAAT 
4 501 GTCTCCTGAT TTCAAGGCTT GAGTAATAGC AACAGGCGAA GCCAACACAA 
4 551 TATCTCCCTC ATGGAAACCG TCCTTCAAGA ACGCACGCCA ACCGCTGTAA 
4 601 ACGCGATGGA ACAGCTTCGG TGGCTGCTGC CCGGCTTTCC ACATGTATCC 
4 651 GAGGCCTTGG TAGACCAAGT TAGTGTCGAC TGTGCCGGGA TTTGGGAACG 
4701 GCCTCATCGA AGACAGGCCG TACCGCTCGA GGAAGAAAAA CAAGCACGAT 
4751 GCAGCAGGAG GAAATCGCAT CGCCCCCATT TCGGCCACCA CGCTGGGAGC 
4801 GTCCTTGAAA GCATGTGACC AAAGCTTGCC TCCAACCCGA TCACTTGCTT 
4 851 CATATATTGT AACATCGTCT ACACCAGCAT GAAGCAGTTC GCTTGCCACC 
4901 ACGAGTCCGG AAATGCCAGC GCCAATGATC GCCACTTTTG GCTTAGGAAC 
4 951 ATCTTCCGGA AAGAAGCCGA TCCGTCCACT ATCGGAACAT TGGTCAAAAA 
5001 ACGATCTGTA GTCATAGAGC AAGTCAATTG TTGGAAAGGA ACCTGCCGAA 
5051 GCCTTCAGCA TCAGGTTTCG ACCATAAGCT ACCATGTCAA ATGGTACGGC 
5101 CTCTTCAAGG GTCAGGGTAA ACCCGGTAAC GCGTTTCGTC GTGCCTTTCT 
5151 CGTAAGGCGG CATTTGCAGG GTTCGGGAGC GGGCAGTCTT AGAGATTGAA 
5201 ATGGCCACGA AATGACTAAG ATCGCTGGAT GCACATCTTT TACCGCTCAG 
5251 CAAAAACAGG GTTTGGAGAA GTTGTGCTTC GGAGAAATCC ATACAAAATG 
5301 GTGCAAGGTC GATGGCAACG AGACCACTCA CTCTTGCTCG CGCCCAAGGT 
5351 TCCGATTCAA GTATCCGCCC CAGAATTTCC TTGCCAACAT AGATGTAAGC 
5401 GGAGAGGGCT GCTACCTTCT CACCAGCTGA GATCTCGGGA AAGCGACCGT 
5451 CGCCCAGCCT TTTCCAAGCT AACCCAGCGC TGCACTCGCC GGAGATTTGA 
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FIG. 


29c 




5501 


GTAATCCTCC 


TTCCCCTAGA 


AGCTTCACGT 


TCTGAGAAGG CATCGGAAAC 


5551 


CCTGCGGTCC 


AATTCATCAG 


CCTTATCGAC 


CATTATCAGA TCCACCATTT 


5601 


TGGTAGAGAA 


ATGATCGCAC 


TGGTTATCAA 


GGAGAGCTGA AGCTGACATg 


5651 


GtGgatcCCG 


GGTCGACtgt 


aattigtaaat 


agtaattgta atgttgttitg 


5701 


ttgtttgttg 


ttgttggtaa 


ttgttgtaaa 


aataAGCTGC AGATACGAAA 


5751 


ACATAAACTT 


GACGTGACAT 


TATATAGGGA 


GGTGGGGGTG GGGGGGGGGG 


5801 


GGTGTAGAAA 


AGaGTTTTTG 


CTGTTATGGa 


CAACTTTGTT CATACAAAAG 


5851 


ATGTGAaCTA 


TTCTTGTTGA 


ACGGGAATCC 


CATTTGAATA TAATTCGTGa 


5901 


GGGTTACTCA 


CCATTACGGA 


TAAAaGGGGT 


TGAATTATTA TTATCTTTTG 


5951 


GaCTTTGTTT 


ATAGTAATTT 


GaCCAATTTT 


AACAACGGCA AAGATCATAT 


6001 


TTAGCTCGCG 


GCTGAAAGTT 


TTTATATTCA 


CTAGCCGAAA AAGTGTATGA 


6051 


AATTTGTATA 


TTTCTTGTAT 


ACATACAAAA 


ATATATTTTT TCGGCTATTA 


6101 


TTTTGAGAGC 


AACTATACAT 


TATTATTTTT 


TCTTTCAAGA AACTTTATTT 


6151 


TTTGTCGGCC 


tGGAATTGGT 


GCACGTTTTC 


TTGGCCCGCG TTATGATTCA 


6201 


AAACTCTGGC 


CCGAACAATG 


GCGTATACAG 


AATTTTTCAC AAAACATATT 


6251 


ATTAACATAT 


TGTTATGCTA 


AAATCCTACA 


AGTTAGAGAT TTATATGATT 


6301 


TAGTAAGTTT 


AGAAACGTTA 


CCTAAAAGTT 


ATTTTGCATG TTTTGTTGGT 


6351 


TCTCTTAAAG 


GGAAAAGTAA 


AATTATTCGT 


AGATAAAATG AGAATTTTTT 


6401 


TTAGTTCGGT 


TCTTTTTAAC 


TAGAGAAAAT 


TGAGTCTTAG GTTCAAATAT 


6451 


GTGAAATTGA 


AGTACTTTGC 


TAAAGGGAAA 


AATAGCACAA AACTCGCTAT 


6501 


GACGTCAAGT 


ACATAAGTAT 


AAAGTACTTT 


GACTAAATAA AAAAGAAAAA 


6551 


TTGAAATTCA 


ATTTACAATT 


ATATTGTAAG 


AATATTAAAT AGTGGACCAC 


6601 


TTTATATACC 


CAAAGAATAA 


ACTTGGTTAT 


TACACTTGAA TGTATGCGAA 


6651 


GAGTTCAACA 


AAAGCTACAT 


GATACTAATA 


TCCGCAGGGC CTTCGTAGAA 


6701 


ATTATAGCGG 


GTAGTCACTT 


TTAAGCCTGT 


TATTTAAAAT GTATTTATAC 


6751 


TTTATAATAT 


ATTTAAAGAT 


TAACTAATTT 


ACTCAAATTT TAGGGCATGA 


6801 


CGTCCTAAAG 


TTTAAACCTC 


AAAGTTCAAA 


CTTCAGGACA TCGTGTCCTA 


6851 


AAGTTCGAAA 


ATCGTGTCCA 


GAAATTCGAA 


TCTTATGTCC TGAATCTTAA 
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FIG. 29d 

6 901 ATTAGCAGTT CAAAAATTTA GGACACTTAG TCCTGAATTT CAAATTAGCA 
6951 GCTCAAAAAT TCAGGATATA TATTTAGTCC TTAAGTTTAG ACGAATTGGC 

7 001 TAATCTTTAA ATACATTATA AATTATGAAT ACATTTTTTA TAGCAAACTT 
7051 AAAAGTGGCT ATTGGTGCAG TTCGCCCAGT ACCATCTCAT ATCAGCCCAA 
7101 TCATTACAAC CCTCCCATAC AAGAGTTCAA AGTTCAAACA TGCAAGCTAA 
7151 CACTATTATA ACAAAaGCCT CTCAAATATC TACATCAaGC TTGGCGTAAT 
7201 CATGGTCATA GCTGTTTCCT GTGTGAAATT GTTATCCGCT CACAATTCCA 
7251 CACAACATAC GAGCCGGAAG CATAAAGTGT AAAGCCTGGG GTGCCTAATG 

73 01 AGTGAGCTAA CTCACATTAA TTGCGTTGCG CTCACTGCCC GCTTTCCAGT 
7 351 CGGGAAACCT GTCGTGCCAG CTGCATTAAT GAATCGGCCA ACGCGCGGGG 
7401 AGAGGCGGTT TGCGTATTGG GCGCTATCGG GGGATCTATA GGGACTTTAC 

74 51 GTGATCTGGA TTTTAGTACT GGATTTTGGT TTTAGGAATT AGAAATTTTA 
.7501 TTGATAGAAG TATTTTACAA ATACAAATAC ATACTAAGGG TTTCTTATAT 
7551 GCTCAACACG TGAGCGAAAC CCTATAAGAA CCCTAATTCC CTTATCTGGG 
7601 AACTACTCAC ACATVTATTA TGGAGAAAAT AGAGAGAGAT AGATTTGTAG 
7651 AGAGAGACTG GTGATTTCGA ACCCCAGAGT CCCGCTCAGA AGAACTCGTC 
7701 AAGAAGGCGA TAGAAGGCGA TGCGCTGCGA ATCGGGAGCG GCGATACCGT 
7751 AAAGCACGAG GAAGCGGTCA GCCCATTCGC CGCCAAGCTC TTCAGCAATA 
7801 TCACGGGTAG CCAACGCTAT GTCCTGATAG CGGTCCGCCA CACCCAGCCG 
7851 GCCACAGTCG ATGAATCCAG AAAAGCGGCC ATTTTCCACC ATGATATTCG 
7901 GCAAGCAGGC ATCGCCATGG GTCACGACGA GATCCTCGCC GTCGGGCATG 
7951 CGCGCCTTGA GCCTGGCGAA CAGTTCGGCT GGCGCGAGCC CCTGATGCTC 
8001 TTCGTCCAGA TCATCCTGAT CGACAAGACC GGCTTCCATC CGAGTACGTG 
8051 CTCGCTCGAT GCGATGTTTC GCTTGGTGGT CGAATGGGCA GGTAGCCGGA 
8101 TCAAGCGTAT GCAGCCGCCG CATTGCATCA GCCATGATGG ATACTTTCTC 
8151 GGCAGGAGCA AGGTGAGATG ACAGGAGATC CTGCCCCGGC ACTTCGCCCA 
8201 ATAGCAGCCA GTCCCTTCCC GCTTCAGTGA CAACGTCGAG CACAGCTGCG 
8251 CAAGGAACGC CCGTCGTGGC CAGCCACGAT AGCCGCGCTG CCTCGTCCTG 
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FIG. 


29e 






8301 


CAGTTCATTC AGGGCACCGG 


ACAGGTCGGT 


CTTGACAAAA 


AGAACCGGGC 


8351 


GCCCCTGCGC TGACAGCCGG 


AACACGGCGG 


CATCAGAGCA 


GCCGATTGTC 


8401 


TGTTGTGCCC AGTCATAGCC 


GAATAGCCTC 


TCCACCCAAG 


CGGCCGGAGA 


8451 


ACCTGCGTGC AATCCATCTT 


GTTCAATCAT 


GCGAAAGGAT 


CCTCAGCCTG 


8501 


TCTCTTGATC TCCTCTCCAA 


ATGAAATGAA 


CTTCCTTATA 


TAGAGGAAGG 


8551 


GTCTTGCGAA GGATAGTGGG 


ATTGTGCGTC 


ATCCCTTACG 


TCAGTGGAGA 


8601 


TATCACATCx. ATCCACTTGC 


TTTGAAGACG 


TGGTTGGAAC 


GTCTTCTTTT 


8651 


TCCACGATGC TCCTCGTGGG 


TGGGGGTCCA 


TCTTTGGGAC 


CACTGTCGGC 


8701 


AGAGGCATCT TGAACGATAG 


CCTTTCCTTT 


ATCGCAATGA 


TGGCATTTGT 


8751 


AGGTGCCACC TTCCTTTTCT 


ACTGTCCTTT 


TGATGAAGTG 


ACAGATAGCT 


8801 


GGGCAATGGA ATCCGAGGAG 


GTTTCCCGAT 


ATTACCCTTT 


GTTGAAAAGT 


8851 


CTCAATAGCC CTTTGGTCTT 


CTGAGACTGT 


ATCTTTGATA 


TTCTTGGAGT 


8901 


AGACGAGAGT GTCGTGCTCC 


ACCATGTTGA 


CGAAGATTTT 


CTTCTTGTCA 


8951 


TTGAGTCGTA AAAGACTCTG 


TATGAACTGT 


TCGCCAGTCT 


TCACGGCGAG 


9001 


TTCTGTTAGA TCCTCGATCT 


GAATTTTTGA 


CTCCATGGCC 


TTTGATTCAG 


9051 


TAGGAACTAC TTTCTTAGAG 


ACTCCAATCT 


CTATTACTTG 


CCTTGGTTTA 


9101 


TGAAGCAAGC CTTGAATCGT 


CCATACTGGA 


ATAGTACTTC 


TGATCTTGAG 


9151 


AAATATATCT TTCTCTGTGT 


TCTTGATGCA 


GTTAGTCCTG 


AATCTTTTGA 


9201 


CTGCATCTTT AACCTTCTTG 


GGAAGGTATT 


TGATCTCCTG 


GAGATTATTA 


9251 


CTCGGGTAGA TCGTCTTGAT 


GAGACCTGCC 


GCGTAGGCCT 


CTCTAACCAT 


9301 


CTGTGGGTCA GCATTCTTTC 


TGAAATTGAA 


GAGGCTAATC 


TT CTCATTAT 


9351 


CGGTGGTGAA CATGGTATCG 


TCACCTTCTC 


CGTCGAACTT 


TCTTCCTAGA 


9401 


TCGTAGAGAT AGAGAAAGTC 


GTCCATGGTG 


ATCTCCGGGG 


CAAAGGAGAT 


9451 


CCTCTAGCTG ATAGTGACCT 


TAGGCGACTT 


TTGAACGCGC 


AATAATGGTT 


9501 


TCTGACGTAT GTGCTTAGCT 


CATTAAACTC 


CAGAAACCCG 


CGGCTGAGTG 


9551 


GCTCCTTCAA TCGTTGCGGT 


TCTGTCAGTT 


CCAAACGTAA 


AACGGCTTGT 


9601 


CCCCCGTCAT CGGCGGGGGT 


CATAACGTGA 


CTCCCTTAAT 


TCTCCGCTCA 


9651 


TCATCAGCGC CTGAGTGGCT 


' CCTTCAACGT TCCGGTTCTG 


TCAGTTCCAA 
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FIG. 29f 



97 01 ACGTAAAACG GCTTGTCCCG CGTCATCGGC GGGGGTCATA ACGTGACTCC 

9751 CTTAATTCTC CGCTCATGAT CAGATTGTCG TTTCCCGCCT TCAGTTTAAA 

9801 CTATCAGTGT TTGACAGGAT ATATTGGCGG GTAAACCTAA GAGAAAAGAG 

9851 CGTTTATTAG AATAATCGGA TATTTAAAAG GGCGTGAAAA GGTTTATCCG 

9901 TTCGTCCATT TGTATGTGCA TGCCAACCAC AGG 
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FIG. 30a 

^^^^ CTGAT GGGCTGCCTG TATCGAGTGG 

2751 TGATTTTGTG CCGAGCTGCC GGTCGGGGAG CTGTTGGCTG GCTGGTGGCA 

2801 GGATATATTG TGGTGTAAAC AAATTGACGC TTAGACAACT TAATAACACA 

2 851 TTGCGGACGT TTTTAATGTA CTGCCATTCG CCATTCAGGC TGCGCAACTG 
2901 TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 
2951 AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC 

3 001 CAGTCACGAC GTTGTAAAAC GAcGGCCAGT GCGAATTCGC AAAACACACC 
3051 TAGACTAGAT TTGTTTTGCT AACCCAATTG ATATTAATTA TATATGATTA 
3101 ATATTTATAT GTATATGGAt TTGGTTAATg AAATGCATCt' GGTTCATCAA 
3151 AGAATTATAA AGACACGTGA CATTCATTTA GGATAAGAAA TATGGATGAT 
3 201 CTCTTTCTCt tTTATTCAGA TAAcTAGTAA TTACACATAA CACACAACTT 
3251 TGATGCCCAC ATTATAGTGA TTAGCATGTC ACTATGTGTG CATCCTTTTA 
3301 TTTCATACAT TAATTAAgTT GGCCAATCCA GAAGATGGAC AAGTCTAGGt 
3 351 TGGGTACCGA GCTCaCGTGG CATACAGCTA ATTAAAAGCA TCGGGAAAGG 
3 401 AAGGAAAATT TATGGCTTTT TCTAATGCTG CCCCGATTGC TAACAGACGG 
3 451 TGGTCTGACC CCGCTAATCC ATCAATTTCC ATTCCAACAG GCAAGCGATC 
3 501 AGGTGTAAGG CAGGCAGGAA GGCTCAACCC AGGTAGGCCT GCGTTGCTGC 
3551 TTGGGTCCAC ATTTCGCACG TAGATCTTGA AAGTGTTCAT CATTGAGCCA 
3601 TTGTGGATGA CTGACGACTC CTGACCTATG GCTTTGGCCG CTAAGGGTGC 

3 651 AGTTGGGAAA AGGATTGCAT CTAACTGATA GAGTCTGAAG TAATTCCGAT 
3701 AAGTGGCCTG GAGCCTTGGC CTGAAGGATT GACGCGCCAG TTCATATTCA 
3751 TCGTTGGAAA TTTGATGCCC ATCAATTTGC GCACTGACAA TGTTCGCTAC 
3801 ATCGGGGCTA CGtATaCCTT TGATAACGTC AGAAAAAGAA ACTGTTCCCA 
3851 CAAAATCGTC GAGATACTTT TTTAGAGCGT GTGGAAATTC GTAAAGCGCA 
3901 ATTGGCAAAC TTGCCCCACT ATTCAGTTCC TCTAGGTGGG GGATGTCGGC 
3951 TTCAACAAAG GTTACGCCTC TGTTGGCTAG CAAGCGAATC GTCGTTTCAC 

4 001 CTGCGAAGGC CACATCAGCA TCAAGGTCAT CGTAAAAGTA GGTAGTGGGG 
4 051 AGGCCGATCC GAAGCCCCTT CAGCGGCATG GGTGAAATTT TCGCCGACCG 
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FIG. 30b 

4101 TCCGGAAATC ACCTGGTCGA GGATTATAAC ATCGGCTACG CACTGCGCTA 
4151 TGATTCCGGC GGTGTCCCGG GTGGGGCTGA CCGGTATTAT CCGATCTCTT 
4201 GGATATCGAG CAAGCGTCGG TCGAAATCCT ACTACGCCAC ACAGGGCTGC 
4251 GGGTAGGCGA ACAGATGCAC CGGTATCGGT GCCTATGCCG CCTAACATCA 
4 3 01 ATCGGCTTGC CACCGCAGCA GCCACACCAC CGCTTGAGCC TCCTGGTATC 

43 51 AGACTTGGAT TCCACGGGTT CCGCACCGCA CCGGTGGCAT AGTTGTTGCT 

44 01 CGTAATTCGA AACGATAAGT CATGCATGTT TCCCGAGGCA CCGGGGAGTG 
4451 CTCCAGCTGA AAAAAGTCTT TCTGCGACGC GGGATGGTAT CTTTGGCAAG 
4501 TGGTTTATCA GCGCCGGAGT AGCAGCGCTT GTAGGAAATA TGCCGGTCGC 
4 551 GATGTTCGCC TTAAAACAGA GTGGAATGCC GCAAAGACCT AATCCGGCGT 
4 601 TTCCATGACG ATCAATTTTT TTGGCGCTTC GCCGCAAGCC ATCCCAGTCT 
4 651 GTAGCCAGAA GGGCATTTAA TGGTTTTGCA GCTTGGCAAC GCGCTATCAG 
4701 AGTTTCTACT AGTTCTAAGC AGGAGTAGTC TTTCCGTCTC AGGCGTTCTA 
4751 GGGTTTGTGC TAACGAGGTA ATGGGCACCA TggTGGATCC CGGGTCGACt 
4801 gtiaattgtiaa atagtaattg taatgttgtt tgttg^ttitgt: tgttgttggt: 
4851 aattgttgta aaaataAGCT GCAGATACGA AAACATAAAC TTGACGTGAC 
4901 ATTATATAGG GAGGTGGGGG TGGGGGGGGG GGGGTGTAGA AAAGaGTTTT 
4951 TGCTGTTATG GaCAACTTTG TTCATACAAA AGATGTGAaC TATTCTTGTT 
5001 GAACGGCAAT CCCATTTGAA TATAATTCGT GaGGGTTACT CACCATTACG 
5051 GATAAAaGGG GTTGAATTAT TATTATCTTT TGGaCTTTGT TTATAGTAAT 
5101 TTGaCCAATT TTAACAACGG CAAAGATCAT ATTTAGCTCG CGGCTGAAAG 
5151 TTTTTATATT CACTAGCCGA AAAAGTGTAT GAAATTTGTA TATTTCTTGT 
5201 ATACATACAA AAATATATTT TTTCGGCTAT TATTTTGAGA GCAACTATAC 
5251 ATTATTATTT .TTTCTTTCAA GAAACTTTAT TTTTTGTCGC CCtGGAATTG 
5301 GTGCACGTTT TCTTGGCCCG CGTTATGATT CAAAACTCTG GCCCGAACAA 
5351 TGGCGTATAC AGAATTTTTC ACAAAACATA TTATTAACAT ATTGTTATGC 
5401 TAAAATCCTA CAAGTTAGAG ATTTATATGA TTTAGTAAGT TTAGAAACGT 
54 51 TACCTAAAAG TTATTTTGCA TGTTTTGTTG GTTCTCTTAA AGGGAAAAGT 
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FIG. 30c 



5501 


AAAATTATTC GTAGATAAAA 


TGAGAATTTT 


TTTTAGTTCG 


GTTCTTTTTA 


5551 


ACTAGAGAAA ATTGAGTCTT 


AGGTTCAAAT 


ATGTGAAATT 


GAAGTACTTT 


5601 


GCTAAAGGGA AAAATAGCAC 


AAAACTCGCT ATGACGTCAA 


GTACATAAGT 


5651 


ATAAAGTACT TTGACTAAAT 


AAAAAAGAAA 


AATTGAAATT 


CAATTTACAA 


5701 


TTATATTGTA AGAATATTAA 


ATAGTGGACC 


ACTTTATATA 


CCCAAAGAAT 


5751 


AAACTTGGTT ATTACACTTG 


AATGTATGCG 


AAGAGTTCAA 


CAAAAGCTAC 


5801 


ATGATACTAA TATCCGCAGG 


GCCTTCGTAG 


AAATTATAGC„ 


GGGTAGTCAC 


5851 


TTTTAAGCCT GTTATTTAAA 


ATGTATTTAT 


ACTTTATAAT 


ATATTTAAAG 


5901 


ATTAACTAAT TTACTCAAAT 


TTTAGGGCAT 


GACGTCCTAA 


AGTTTAAACC 


5951 


TCAAAGTTCA AACTTCAGGA 


CATCGTGTCC 


TAAAGTTCGA 


AAATCGTGTC 


6001 


CAGAAATTCG AATCTTATGT 


CCTGAATCTT 


AAATTAGCAG 


TTCAAAAATT 


6051 


TAGGACACTT AGTCCTGAAT 


TTCAAATTAG 


CAGCTCAAAA 


ATTCAGGATA 


6101 


TATATTTAGT CCTTAAGTTT 


AGACGAATTG 


GCTAATCTTT 


AAATACATTA 


6151 


TAAATTATGA ATACATTTTT 


TATAGCAAAC 


TTAAAAGTGG 


CTATTGGTGC 


6201 


AGTTCGCCCA GTACCATCTC 


ATATCAGCCC 


AATCATTACA 


ACCCTCCCAT 


6251 


ACAAGAGTTC AAAGTTCAAA 


CATGCAAGCT 


AACACTATTA 


TAACAAAaGC 


6301 


CTCTCAAATA TCTACATCAa 


GCTTGGCGTA 


ATCATGGTCA 


TAGCTGTTTC 


6351 


CTGTGTGAAA TTGTTATCCG 


CTCACAATTC 


CACACAACAT 


ACGAGCCGGA 


6401 


AGCATAAAGT GTAAAGCCTG 


GGGTGCCTAA 


TGAGTGAGCT 


AACTCACATT 


6451 


AATTGCGTTG CGCTCACTGC 


CCGCTTTCCA 


GTCGGGAAAC 


CTGTCGTGCC 


6501 


AGCTGCATTA ATGAATCGGC 


CAACGCGCGG 


GGAGAGGCGG 


TTTGCGTATT 


6551 


GGGCGCTATC GGGGGATCTA 


TAGGGACTTT 


AGGTGATCTG 


GATITTAGTA 


o O U X 


UXV^Vjivi X 1 1 w X X X ±A\jK9JxPl 


TTAGAAATTT 


TATTGATAGA 


AGTAi,"X"l"i'AC 


6651 


AAATACAAAT ACATACTAAG 


GGTTTCTTAT 


ATGCTCAACA 


CGTGAGCGAA 


6701 


ACCCTATAAG AACCCTAATT 


CCCTTATCTG 


GGAACTACTC 


ACA.CATVTAT 


6751 


TATGGAGAAA ATAGAGAGAG 


ATAGATTTGT 


AGAGAGAGAC 


TGGTGATTTC 


6801 


GAACCCCAGA GTCCCGCTCA 


GAAGAACTCG 


TCAAGAAGGC 


GATAGAAGGC 


6851 


GATCCCCTGC GAATCGCGAG 


CGGCGATACC 


CTAAAGCACC 


ACGAACCGCT 
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FIG. 30d 

6901 CAGCCCATTC GCCGCCAAGC TCTTCAGCAA TATCACGGGT AGCCAACGCT 
6951 ATGTCCTGAT AGCGGTCCGC CACACCCAGC CGGCCACAGT CGATGAATCC 
7001 AGAAAAGCGG CCATTTTCCA CCATGATATT CGGCAAGCAG GCATCGCCAT 
7051 GGGTCACGAC GAGATCCTCG CCGTCGGGCA TGCGCGCCTT GAGCCTGGCG 
7101 AACAGTTCGG CTGGCGCGAG CCCCTGATGC TCTTCGTCCA GATCATCCTG 
7151 ATCGACAAGA CCGGCTTCCA TCCGAGTACG TGCTCGCTCG ATGCGATGTT 
7201 TCGCTTGGTG GTCGAATGGG CAGGTAGCCG GATCAAGCGT ATGCAGCCGC 
7251 CGCATTGCAT CAGCCATGAT GGATACTTTC TCGGCAGGAG CAAGGTGAGA 
73 01 TGACAGGAGA TCCTGCCCCG GCACTTCGCC CAATAGCAGC CAGTCCCTTC 

73 51 CCGCTTCAGT GACAACGTCG AGCACAGCTG CGCAAGGAAC GCCCGTCGTG 

74 01 GCCAGCCACG ATAGCCGCGC TGCCTCGTCC TGCAGTTCAT TCAGGGCACC 
74 51 GGACAGGTCG GTCTTGACAA AAAGAACCGG GCGCCCCTGC GCTGACAGCC 
7501 GGAACACGGC GGCATCAGAG CAGCCGATTG TCTGTTGTGC CCAGTCATAG 
7551 CCGAATAGCC TCTCCACCCA AGCGGCCGGA GAACCTGCGT GCAATCCATC 
7601 TTGTTCAATC ATGCGAAAGG ATCCTCAGCC TGTCTCTTGA TCTCCTCTCC 
7 651 AAATGAAATG AACTTCCTTA TATAGAGGAA GGGTCTTGCG AAGGATAGTG 
7701 GGATTGTGCG TCATCCCTTA CGTCAGTGGA GATATCACAT CAATCCACTT 
7751 GCTTTGAAGA CGTGGTTGGA ACGTCTTCTT TTTCCACGAT GCTCCTCGTG 
7 801 GGTGGGGGTC CATCTTTGGG ACCACTGTCG GCAGAGGCAT CTTGAACGAT 

7 851 AGCCTTTCCT TTATCGCAAT GATGGCATTT GTAGGTGCCA CCTTCCTTTT 
7901 CTACTGTCCT TTTGATGAAG TGACAGATAG CTGGGCAATG GAATCCGAGG 
7951 AGGTTTCCCG ATATTACCCT TTGTTGAAAA GTCTCAATAG CCCTTTGGTC 

8 001 TTCTGAGACT GTATCTTTGA TATTCTTGGA GTAGACGAGA GTGTCGTGCT 
8051 CCACCATGTT GACGAAGATT TTCTTCTTGT CATTGAGTCG TAAAAGACTC 
8101 TGTATGAACT GTTCGCCAGT CTTCACGGCG AGTTCTGTTA GATCCTCGAT 
8151 CTGAATTTTT GACTCCATGG CCTTTGATTC AGTAGGAACT ACTTTCTTAG 
8201 AGACTCCAAT CTCTATTACT TGCCTTGGTT TATGAAGCAA GCCTTGAATC 
8251 GTCCATACTG GAATAGTACT TCTGATCTTG AGAAATATAT CTTTCTCTGT 
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FIG. 30e 



8301 


GTTCTTGATG CAGTTAGTCC 


TGAATCTTTT 


GACTGCATCT 


TTAACCTTCT 


8351 


TGGGAAGGTA TTTGATCTCC TGGAGATTAT TACTCGGGTA 




8401 


ATGAGACCTG CCGCGTAGGC 


CTCTCTAACC 


ATCTGTGGGT 




8451 


TCTGAAATTG AAGAGGCTAA TCTTCTCATT 






8501 


CGTCACCTTC TCCGTCGAAC 


TTTCTTCCTA 


GATCGTAGAG 




8551 


TCGTCCATGG TGATCTCCGG 


GGCAAAGGAG 


ATCCTCTAGC 


TGATAGTGAC 


8601 


CTTAGGCGAC TTTTGAACGC 


GCAATAATGG 


TTTCTGACGT 


ATGTGCTTAG 


8651 


CTCATTAAAC TCCAGAAACC 


CGCGGCTGAG 


TGGCTCCTTC 


AATCGTTGCG 


8701 


GTTCTGTCAG TTCCAAACGT 


AAAACGGCTT 


GTCCCGCGTC 


ATCGGCGGGG 


8751 


GTCATAACGT GACTCCCTTA 


ATTCTCCGCT 


CATGATCAGC 


GGCTGAGTGG 


8801 


CTCCTTCAAC GTTGCGGTTC 


TGTCAGTTCC 


AAACGTAAAA 


CGGCTTGTCC 


8851 


CGCGTCATCG GCGGGGGTCA 


TAACGTGACT 


CCCTTAATTC 


TCCGCTCATG 


8901 


ATCAGATTGT CGTTTCCCGC 


CTT CAGTTTA 


AACTATCAGT 


GTTTGACAGG 


8951 


ATATATTGGC GGGTAAACCT 


AAGAGAAAAG 


AGCGTTTATT 


AGAATAATCG 


9001 


GATATTTAAA AGGGCGTGAA 


AAGGTTTATC 


CGTTCGTCCA 


TTTGTATGTG 


9051 


CATGCCAACC ACAGG 
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